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I  INTRODUCTION  AND  SUMMARY 

A.  Background 

The  state  of  the  sky  is  typically  described  by  a  few  inte¬ 
grated  parameters,  such  as  the  total  sky  cover,  the  cloud  amoi  nt  in 
layers,  and  the  ceiling  height.  This  summarizes  what  might  be  quite 
complicated  phenomena,  only  small  portions  of  which  are  sampled  by 
ceilometers  and  other  instruments.  The  present  procedure  for  reporting 
the  state  of  the  sky  relies  heavily  on  human  observers  to  make  visual 
observations  and  interpret  instrument  readings .  To  automate  this  sys¬ 
tem,  one  must  be  able  to  specify  the  required  instrument  characteristics 
and  the  procedures  for  processing  data  to  relate  instrument  readings 
to  actual  sky  conditions . 

In  an  earlier  phase  of  this  contract,  we  developed  a  mathe¬ 
matical  cloud  model  as  a  tool  for  solving  these  problems  (Duda,  Mancuso 
and  Blackmer,  1970)  .  This  model  describes  a  sky  composed  of  several 
layers  of  clouds  at  different  mean  base  heights,  each  layer  moving  at 
a  specified  speed  in  a  specified  direction.  The  clouds  in  each  layer 
are  described  by  specifying  a  number  of  parameters,  such  as  the  mean 
length  and  percent  sky  cover.  By  varying  these  parameters,  a  wide 
variety  of  sky  conditions  can  be  obtained.  This  model  was  implemented 
as  a  digital  computer  program  that  calculates  the  responses  of  vertically- 
pointing  ceilometers  to  the  model  sky  conditions.  By  using  this  simula¬ 
tion  program,  the  behavior  of  an  automatic  system  for  describing  the 
state  of  sky  can  be  determined. 

B.  Objectives 

The  basic  objectives  of  the  work  described  in  this  report 
were  to  gain  insight  into  the  cost/effectiveness  of  various  sampling 
and  processing  strategies,  and  to  compare  the  performance  of  various 
ceilometers,  both  to  evaluate  current  sensors  and  to  establish  speci¬ 
fications  for  future  instruments.  The  investigation  involved  three 
major  tasks: 

(1)  Sampling  and  processing  studies, 

(2)  Ceilometer  studies,  and 

(3)  Feasibility  and  cost  comparisons. 
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In  Task  1,  the  emphasis  was  on  determining  the  spatial  and 
temporal  sampling  and  the  data  processing  necessary  to  describe  cloud 
amount  and  ceiling  height  to  a  specified  degree  of  accuracy.  Most  of 
this  work  involved  ideal  or  error-free  instruments,  although  the  effects 
of  instrument  limitations  were  also  considered.  The  purpose  of  Task  2 
was  to  evaluate  the  ability  of  vertically-pointing  and  scanning  instru¬ 
ments  to  measure  cloud  amount  and  ceiling  height,  and  to  determine 
instrument  characteristics  needed  to  meet  given  operating  specifications 
Task  3  involved  the  use  of  the  simulation  program  to  determine  the  tech¬ 
nical  feasibility  of  an  automatic  system.  It  also  included  a  cost  com¬ 
parison  with  a  nonautomated  system  to  determine  economic  feasibility. 

The  remainder  of  this  report  presents  the  results  of  our 
research  on  these  three  tasks.  The  organization  of  the  report  is  based 
on  the  natural  division  of  the  problem  into  the  separate  problems  of 
cloud-amount  estimation,  cloud  base-height  estimation,  and  cost  compari¬ 
sons.  In  the  remainder  of  this  section  we  summarize  the  major  results 
of  this  work  and  show  how  they  relate  to  the  three  major  tasks  . 

C.  Task  1 — Sampling  and  Processing  Studies 

The  basic  sampling  problem  is  to  determine  the  spatial  and 
temporal  sampling  required  to  determine  cloud  amount  and  cloud  base 
height  to  a  specified  degree  of  accuracy.  Since  the  determination  of 
cloud  amount  or  cloud  base  height  required  processing  the  data  obtained 
from  sampling,  sampling  and  processing  were  often  considered  together. 
The  primary  processing  technique  for  cloud  amount  estimation  was  the 
time  averaging  of  readings  from  the  various  instruments.  A  greater 
variety  of  techniques  were  considered  for  cloud  height  estimation, 
including  techniques  for  probability  density  function  estimation,  mix¬ 
ture  density  analysis,  and  clustering. 

Most  of  our  studies  involved  ideal  instruments  that  gave 
exact  reports  of  the  conditions  they  sampled.  The  sampling  error  is  due 
to  the  fact  that  not  all  points  in  space  and  time  are  sampled.  In  addi¬ 
tion,  intervening  lower  layers  can  interrupt  the  sampling  of  upper  layer 
A  combination  of  mathematical  analysis  and  computer  simulation  was  used 
to  show  how  the  error  depended  on  the  following  factors: 


(1)  Sampling  parameters 

(a)  Number  of  sensors 

(b)  Area  covered 

(c)  Sensor  configuration 

(d)  Time  between  samples 

(2)  Processing  parameters 

(a)  Averaging  time 

(b)  Window  widths 

(c)  Number  of  cluster  centers 

(3)  Cloud  parameters 

(a)  Mean  cloud  length 

(b)  Cloud  amount 

(c)  Cloud  speed 

(d)  Mean  base  height 

(e)  Base-height  standard  deviation 

(f)  Base-height  correlation  distance 


The  major  results  of  this  study  can  be  summarized  as  follows. 

The  root  mean  square  error  d  in  estimating  cloud  amount  was  found  to 

depend  primarily  upon  /A/£  .  Figure  6  shows  the  number  of  instruments 

m 

needed  to  obtain  a  specified  c  for  a  worst  case  situation  with  c^  =  0,5 
and  no  time  averaging.  By  optimum  time  averaging,  the  required  number 
of  instruments  can  be  reduced  greatly,  with  four  vertically-pointing 
instruments  theoretically  being  able  to  exceed  human  performance. 

The  root  mean  square  error  in  estimating  the  base  height 
>f  a  single  cloud  layer  depends  primarily  upon  and  vT/d,  and  there 
exist  theoretically  optimum  procedures  for  sampling  and  processing. 

The  multiple-layer  case  is  much  more  difficult.  In  our  experiments, 
good  results  wore  obtained  with  a  simple  hierarchical  clustering 
procedure  described  in  Section  IV-D-2.  This  procedure  was  able  to 
determine  the  number  of  cloud  layers,  the  mean  height  of  each  layer, 
and  the  cloud  amount  in  layers  for  a  variety  of  experimental  conditions, 
these  results  may  be  sensitive  to  instrument  limitations,  and  an 
investigation  of  way-,  to  detect  and  overcome  tli.-  effects  of  jut*  rlerence 
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and  precipitation  is  needed.  A  discussion  of  these  problems  concludes 
Section  IV. 

D.  Task  2— Ceilometer  Studies 

The  goal  of  Task  2  was  to  evaluate  the  performance  of  both 
vertically-pointing  and  scanning  instruments  used  to  measure  cloud 
amount  and  ceiling  height,  and  to  relate  performance  to  required  instru¬ 
ment  characteristics.  In  this  work,  the  model  was  used  to  provide  infor¬ 
mation  about  the  true  sky  conditions,  allowing  comparisons  against  an 
objective  standard. 

The  performance  of  vertically  pointing  instruments  is  directly 

related  to  root  mean  square  error  <9.  A  major  result  of  our  study  was  to 

relate  9  to  the  time  AT  between  measurements.  Even  if  AT  is  zero,  some 

error  is  unavoidable.  The  percentage  increase  in  a  as  a  function 

of  AT  was  determined,  and  a  criterion  for  selecting  AT  is  suggested  in 

Section  III-C-2.  For  cloud  amount  estimation,  this  criterion  depends 

on  the  cloud  speed  v  and  the  mean  cloud  length  i  .  For  cloud  base 

m 

height  estimation,  it  depends  on  the  cloud  speed  and  the  base  height 
correlation  distance  d.  Actual  selection  of  instrument  parameters 
requires  a  decision  about  the  range  of  cloud  conditions  over  which  a 
desired  performance  is  to  be  achieved;  such  decisions  were  considered 
to  be  outside  of  the  scope  of  this  study. 

Our  original  simulation  model  did  net  include  provisions  for 
scanning  instruments .  To  add  these  instruments  to  our  study,  we  limited 
our  attention  to  a  single  cloud  layer  ar.d  worked  in  terms  of  an  equiva¬ 
lent  system  of  vertically-pointing  instruments .  The  chief  advantage  of 
scanning  instruments  is  that  for  intermediate  and  high  clouds,  one 
scanning  instrument  can  replace  many  vertically-pointing  instruments. 
However,  it  was  found  that  the  performance  of  scanning  instruments  was 
limited  at  low  ceiling  heights  by  geometrical  constraints  (see  Section 
III-E).  In  addition  to  sampling  problems,  theru  are  particular  problems 
associated  with  the  physics  of  different  kinds  of  instruments.  For 
example,  the  performance  of  scanning  radars  depends  on  the  wavelength 
and  pulse  length  used,  and  the  usefulness  of  scanning  Ildars  might  be 
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limited  by  safety  considerations.  Appendix  A  contains  a  discussion  of 
these  considerations  for  radar  systems,  and  Appendix  Q  contains  a 
similar  discussion  for  lidar  systems.  The  results  of  our  sampling 
study  also  apply  to  other  types  of  scanning  sensors,  such  as  the 
passive  infra-red  sensor,  but  only  the  radar  and  lidar  systems  were 
considered  in  detail. 

E.  Task  3 — Feasibility  and  Cost  Comparisons 

The  feasibility  of  an  automatic  system  depends  upon  many 
factors,  including  the  performance  required  and  the  nature  of  the  con¬ 
ditions  at  the  particular  airport.  To  the  extent  that  our  model  accurately 
describes  sky  conditions  and  instrument  characteristics,  we  have  demon¬ 
strated  the  technical  feasibility  of  an  automatic  system  for  measuring 
cloud  amount  at  least  as  accurately  as  is  done  at  present .  The  technical 
feasibility  of  using  clustering  techniques  to  automatically  determine 
cloud  base  heights  was  also  demonstrated,  both  with  the  model  data  and 
with  a  small  sample  of  actual  ceilometer  data.  However,  some  further 
development  of  these  techniques  will  probably  be  needed  to  accommodate 
possible  bad  effects  of  interference  and  precipitation. 

Section  V  contains  a  comparison  of  the  cost  of  an  automatic 
system  and  the  cost  of  the  present  manual  system.  In  this  comparison, 
it  was  assumed  that  both  systems  gave  equivalent  performance,  and  other 
possible  benefits  of  either  system  were  not  considered.  It  was  found 
that  tht  results  depended  on  the  frequency  with  which  cloud  amount  and 
ceiling  height  reports  are  needed.  If  the  present  frequency  of  one 
reading  per  hour  is  satisfactory,  then  the  current  manual  system  is 
less  expensive.  However,  if  readings  are  required  every  ten  minutes 
or  more  often,  automatic  systems  can  be  less  expensive.  The  effects  of 
cost  uncertainties  are  included  in  this  analysis. 

F.  Report  Organization 

The  remainder  of  this  report  contains  the  technical  material 
that  justifies  these  conclusions.  Section  II  reviews  the  cloud  model. 

It  describes  the  parameters  involved  in  our  study  and  their  effects  on 
the  simulation  sky  conditions.  Section  III  is  concerned  with  cloud 
estimation.  It  includes  precise  definitions  of  terms,  and  separate 


studios  of  estimation  by  single  vertically-pointing  instruments  anil 
scanning  instruments.  Section  IV  is  concerned  with  cloud  base-height 
estimation.  It  defines  base  heights  for  multiple-layer  clouds  in  terms 
of  the  parameters  in  a  mixture  density  function.  The  results  jf  various 
techniques  for  estimating  density  functions  and  clustering  cloud-height 
readings  are  reported.  Section  V  shows  how  these  theoretical  results 
can  be  used  to  determine  instrument  and  data  processing  requirements. 
Section  VI  presents  the  cost  comparison,  and  Section  Vil  presents  the 
conclusions  of  the  study. 
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II  THE  CLOUD  MODEL 

A .  Introduction 

In  this  section  we  give  a  brief  review  of  those  aspects  of 
the  clou.i  model  that  one  must  know'  to  understand  the  rest  of  this 
report;  a  complete  description  of  the  cloud  model  is  given  in  the 
report  by  Duda,  Mancuso  and  Blackmer  (1970).  The  cloud  model  describes 
a  sky  composed  of  oha  or  more  layers  of  clouds.  The  clouds  in  each 
layer  have  random  characteristics,  but  their  statistical  properties  are 
fixed  by  a  small  number  of  parameters.  Since  these  parameters  have  con¬ 
stant  valies  everywhere  in  a  given  layer,  each  layer  is  statistically 
homogeneous . 

The  geometrical  parameters  determine  the  sizes  and  shapes  of 
the  clouds.  They  fall  into  two  classes,  those  that  affect  the  plan 
view  and  those  that  affect  the  profile  view.  In  p1an  view,  a  layer 
looks  like  an  infinite  plane  covered  with  a  scattering  of  possibly  over¬ 
lapping  rectangles  of  different  sizes.  If  the  cloud  amount  is  five 
tenths  or  less,  these  rectangles  represent  clouds,  while  if  the  cloud 
amount  exceeds  five  tenths  they  represent  holes.  To  simplify  the  follow¬ 
ing  discussion,  we  shall  assume  that  the  rectangular  areas  are  clouds. 

If  the  cloud  amount  exceeds  five  tenths,  the  statements  made  about 
clouds  should  be  interpreted  as  pertaining  to  holes. 

B.  Plan-l.ew  Parameters 

The  appearance  of  the  clouds  in  plan  view  is  affected  by  the 
following  parameters: 

(1)  Mean  length  l 

m 

(2)  Aspect  ratio  p 

(3)  Cloud  amount  c 

a 

(4)  Cloud  speed  v 

(5)  Cloud  direction  >p 

c 

(6)  .  Layer  l>eginning  time  t 

(7)  Layer  ending  time  t 


* 

Ir.  some  cases  distances  will  be  reported  in  feet  rather  than  meters. 


(m) 
(-) 
(-) 
(m/s) 
( rad ) 

(s) 

(s) 
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The  clouds  generated  by  th°  model  are  rectangles  of  length 
l  ir.  the  direction  of  motion  and  width  p i  normal  to  the  direction  of 
motion.  The  lengths  i  have  an  exponential  distribution  that  is  com¬ 
pletely  specified  by  the  mean  length  j l  .  The  cloud  amount  :  soecifies 
'  m  a 

the  fraction  of  the  entire  infinite  plane  that  is  covered  with  clouds. 

The  cloud  amount  over  soma  specified  area  A  may  be  either  less  than  or 

greater  than  c  ,  and  will  vary  as  the  clouds  move.  All  of  the  clouds 
a 

in  a  given  layer  move  in  unison  with  the  common  speed  v  in  a  common 
direction  specified  by  i^.  This  greatly  simplifies  the  computation  of 
the  instrument  responses  as  a  function  of  time.  The  instrument  loca¬ 
tions  are  given  in  a  conventional,  earth- ref erenced  xy-coordinate  sys¬ 
tem.  If  one  wants  to  consider  only  part  of  the  infinite  cloud  layer,  a 
strip  of  clouds  can  be  defined  by  t  and  t  .  The  layer-beginning-time 
t  specifies  the  time  at  which  the  leading  edge  of  the  layer  passes  over 


the 

origin,  and 

the  layer-ending  time  does  the  same 

thing 

for  the  trail 

ing 

edge . 

C.  Profile-View  Parameters 

The 

appearance  of  the  clouds  in  profile 

view  is 

affected  by 

the 

following  parameters: 

(1) 

Mean  thickness 

t 

(m) 

(2) 

Mean  base  height 

h 

(m) 

(3) 

Base-height  standard  deviation 

% 

(m) 

(4) 

Base-height  correlation  distance 

d 

(m) 

The  tops  of  the  clouds  are  always  located  at  h  +  t .  The  base 

height  h(x)  is  a  randomly  generated  function  of  x,  having  mean  li  and 

standard  deviation  J, .  The  correlation  coefficient  between  two  base 

b 

heights  a  distance  Q  apart  is  given  by  exp[-£/d],  where  d  is  the  base- 
height  correlation  distance.  Thus,  adjacent  base-height  values  are 
correlated,  but  the  correlation  is  very  small  between  points  that  are 
separated  by  distances  much  greater  than  d.  As  the  clouds  move,  the 
base-height  profile  moves  with  them.  Thus,  if  the  x-axis  is  aligned 
with  the  direction  of  clouo  motion,  the  base  height  function  should 
really  be  written  as  h(x  +  vt),  so  that  h(x)  gives  the  base  height,  at 
t  i  me  t  =  0 . 
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D.  Instrument  Parameters  and  Normalization 


Most  of  this  report  is  concerned  with  the  response  of  ideal 
instruments  to  sky  conditions  that  can  be  generated  by  the  cloud  model. 

By  an  ideal  instrument,  we  mean  one  that  sights  along  a  straight  line, 
does  not  give  a  response  when  no  clouds  are  intercepted,  and  gi  'es  the 
exact  range  to  the  first  point  of  intersection  when  a  cloud  is  inter¬ 
cepted.  With  an  ideal  vertically-pointing  instrument,  the  line  of  sight 
is  to  the  zenith.  We  do  not  assume  that  the  base  height  readings  are 
obtained  continually,  but  rather  that  readings  are  made  every  AT  seconds. 
For  an  ideal  vertically-pointing  instrument,  this  is  the  only  instrument 
parameter  of  interest. 

In  general,  an  ideal  vertically-pointing  instrument  provides 

samples  of  the  base-height  function  h(x) .  If  we  assume  for  simplicity 

that  the  instrument  is  located  at  the  origin  and  that  the  x-axis  is 

aligned  with  the  direction  of  cloud  motion,  then  at  the  kth  sampling 

instant  we  obtain  the  reading  h(kvAT) .  This  shows  that  not  all  of  the 

parameters  are  independent  variables.  If  v  were  doubled,  AT  were  halved, 

and  all  other  conditions  were  left  the  same,  exactly  rhe  same  sequence 

of  readings  would  be  obtained.  By  normalizing  the  dimensional  parameters 

with  respect  to  distance  and  time,  we  can  reduce  the  number  of  variables 

and  simplify  the  presentation  of  results.  While  normalization  can  be 

done  in  many  ways,  we  most  frequently  will  use  the  mean  cloud  length  Z 

m 

to  normalize  distances  and  the  sampling  interval  AT  to  normalize  time. 

Thus,  the  normalized  correlation  distance  is  d/Z  ,  and  the  normalized 

*  m 

cloud  speed  is  vAT/4  . 


$ 

This  normalization  is  most  convenient  in  the  single-layer  case.  With 
multiple  layers,  one  can  use  the  mean  cloud  length  in  the  lowest  layer 
for  normalization,  but  this  is  somewhat  inconvenient.  In  such  cases 
we  occasionally  use  the  square  root  of  the  airport  area  to  normalize 
distances . 
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CUXJD  JMOCXT  ESTIMATION 
A.  Definitions 


IVi  ikj  cover  is  estimated  fay  busan  obeerrers,  it  is 
reported  ns  tecths  or  eighths  of  the  sky  covered  by  surface-based 
pheeoaeBs  and  by  clouds  or  ohs  curing  phenomena  in  each  layer  al<s::. 

The  eraluatioe  is  sade  ia  terns  of  the  entire  sky  area  a  bow  : 'no  I  >eal 
or  apparent  horizon,  and  is  fundamentally  a  surface-based  otser.ativr, 
United  to  a  roughly  circular  area  cf  several  miles  radius  centered 
cboot  the  observer. 

’-.-hen  iBtrmKAts  are  to  give  ac  objective  ^eisurv  -.«■ 

cloud  wocEt,  it  is  necessary  to  be  somevhat  no  re  precise  is  defining 
cloud  m aunt,  particularly  vita  regard  to  the  are i  involved.  A  si*U- 
vert i cal ly-poi -"ting  instmrat  gives  an  excellent  report  ■>'  the  cloud 
amount  directly  above  that  instilment,  but  nay  nor  give  a  correct 
report  of  the  cloud  mouit  in  a  circular  area  of.  say,  firc-eiie  radius 
around  tint  imtroaeac.  In  fact,  a  haaa  observer  has  a  sinilar  Iisuta- 
tion,  but  since  he  can  csralljr  observe  the  entire  area  of  interest  for 
operations  at  one  airport,  this  limitation  is  usually  not  aticev*. 

In  this  report,  xe  define  the  cloud  amount  over  a  given  area 
A  ns  the  fraction  of  that  area  covered  fay  a  vertical  projection  the 
clouds  onto  the  ground,  (fan  several  layers  arc  p-esent.  this  ' : res 
the  total  cloud  amount,  la  describing  the  cloud  inxirt  ;s  layers oc«.- 
«s(  distinguish  between  the  civ  -ad  wait  actually  oh-embii  :  r--s 
ground  and  the  cloud  amount  for  us  isolated  layer.  It.  sic  Mit.r  > i : * 
conventional  pmctice,  we  define  tne  (cbserrable)  cloud  imtsis;  for  a 
given  layer  ns  the  fraction  of  *.he  area  that  is  covered  by  clevis  ir. 
that  layer  and  not  ciTereJ  by  clouds  in  loser  layt-rs .  *e  also  iefic-e 
the  model  cloud  amount  as  the  fraction  of  the  area  that  is  mem  by 
clouds  in  that  layer  iR  it*  absence  cf  other  layers.  in  this  section 
ae  shall  only  be  concerned  with  estimating  the  total  cloud  anount :  se 
shall  return  to  the  question  of  measuring  the  (observable j  cloud  iEx*n 
in  layers  in  Section  IV. 

3.  Variables 

Vhen  instruments  are  used  to  estimate  the  clone  aa-jur.:  --•\er 

an  area  A,  the  results  obtaived  depend  on  the  followinf.  quart  i  t  ic-s : 

Preceding  page  Mad 
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<1) 

The  area 

A 

(m) 

(2) 

The  number  of  Instruments 

it 

(-) 

(3) 

The  time  interval  between  readings 

AT 

(s) 

(4) 

llte  total  observation  time 

T 

(s) 

(5) 

The  plan-view  parameters: 

Mean  length 

l 

■» 

(m) 

Aspect  ratio 

P 

(-) 

Cloud  amount 

K* 

a 

(-) 

Cloud  speed 

V 

(m/s) 

Cloud  direction 

V 

c 

(-ad) 

Layer  duration 

t  -t. 
e  b 

<s) 

Even  when  we  normalize  by  and  AT,  we  see  that  the  performance 
depends  on  at  least  eight  variables.  It  is  not  possible  to  consider 
all  combinations  of  all  of  these  variables  in  an  experimental  program 
to  find  the  optima  sampling  and  processing  procedures.  In  our  inves¬ 
tigations,  we  paid  particular  attention  to  what  we  thought  were  the  most 
criiicial  variables.  In  particular,  we  ignored  the  effects  of  the  aspect 

ratio  and  the  cloud  direction,  taking  p  =  i  and  i  =0.  Since  our  ini- 

c 

tial  experiments  disclosed  that  five-tenths  cloud  amount  provided  the 
greatest  difficulty,  we  iiaiteu  most  of  our  experiments  to  this  case. 
Finally,  because  of  experimental  difficulty,  we  considered  only  one 
case  xa  which  the  layer  duration  was  not  infinite.  This  means  that  we 
studied  Mainly  statistically  uniform  situations.  We  have  tried  to 
insert  cautionary  re narks  whenever  neglect  of  this  variable  night  result 
in  conclusions  that  are  act  valid  in  general. 

in  the  remainder  of  this  section  we  show  bom  cloud  axount  esti¬ 
mates  defend  on  the  remaining  variables .  We  consider  first  the  per- 
f oraacce  one  instrument  and  show  how  it  can  be  improved  uy  tiae 
averaging.  .After  that  we  consider  the  case  where  r«  instruments  are 
used  to  give  instantaneous  ('snapshot')  estimates  of  the  cloud  amount 

and  show  ho*  ths  performance  varies  with  n.  A,  and  f  .  Finally,  we  con- 

o 

sider  the  additional  improvement  that  can  be  obtained  b>  averaging  ,  and 
: ct reduce  thv  effects  of  AT,  T,  and  v. 
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C.  Single- Instrument  Sampling 


1 .  Without  Time  Averaging 

An  ideal  vertically-pointing  instrument  can  detect  whether 

or  not  there  is  a  cloud  directly  overhead.  Let  c(x,y)  indicate  the 

presence  or  absence  of  a  cloud  above  a  given  noint  (x,y),  i th  c  =  1  if 

a  cloud  is  present  and  c  =  0  if  a  cloud  is  absent.  Let  c  give  the  total 

cloud  amount  over  a  specilied  area  A,  and  let  c  be  the  total  cloud 

a 

mount  for  the  cloud  layer.  Then 


and 


c 


1 

A 


//  c(x;  y)  dx  dy 
A 


c 

a 


lim  ~ 
A 


(1) 


(2) 


If  a  single  instrument  is  located  at  (XQ»yo)»  then  its 

readings  give  c(x^,yQ).  We  want  to  use  these  readings  to  estimate  c\ 

A 

Let  c  be  our  estimate  of  c.  The  simplest  of  all  estimates  is  just  to 
take  c  =  the  instantaneous  instrument  reading.  If  the  area  A  is 
small  (relative  to  the  square  of  the  mean  cloud  length),  then  this 
estimate  is  very  good.  Howe'er,  if  A  is  iarge,  this  estimate  is  poor. 
The  accuracy  of  the  estimate  is  measured  by  the  error  c  -  c,  or,  since 
c,  c  and  c  .  *e  random  variables,  by  the  expected  squared  error* 


ct2  =  e[(c  -  c) 


o  .. 


C3) 


=  E[ 


S2]  - 


2E^ 


c  c 


Er< 


(4) 


Under  ordinary  conditions,  the  probability  of  error  for 
the  estimate  c  can  be  related  to  the  root-mean-square  error  ?  by  c.ssum- 
ing  that  c  is  normally  distributed  with  mean  c  and  variance  r  .  in  this 

*  «2 

Note  that  a  is  not  the  variance  of  c  unless  the  area  is  infinite,  in  which 

case  c  =  c  =  Efcl. 
a 
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case,  the  probability  that  c  lies  in  the  interval  -A  £  c  A  is  given  by 

~+a  i  (tS f 

P  =  — —  /  e"  2  \  °  /  dc  (5) 

c  /Fac-A 

This  probability  is  shown  graphically  in  Figure  1. 

Note  that  a  must  be  rather  small  to  obtain  good  accuracy.  For  example, 
if  one  wants  tht  estimate  to  be  within  one  tenth  of  the  true  cloud  amount 
at  least  98  percent  of  the  time,  a  must  be  less  than  0.1.  For  this  same 
performance  i.5%  of  the  time,  a  must  be  less  than  0.05.  According  to 
Galligan,  when  the  cloud  amount  is  near  five  tenths,  human  observers 
have  a  a  from  0.107  to  0.123  (Galligan,  1953).  This  should  be  kept 
in  mind  when  specifying  the  desired  performance  for  an  automatic 
system . 

In  Section  I I 1-0-2  we  shall  see  how  a  increases  as  a 

function  of  A.  At  this  point,  however,  we  consider  only  the  limiting 

situation  as  A  approaches  infinity.  For  this  case,  c  -*  c  ,  and 

Ql 

Alim  a2  =  e[c21  -2c  e[c1  +  c2  (6) 

A  -  oo  ^  a  a 

Since  c  is  either  one  or  zero,  being  one  with  probability  c  and  zero 

3 

with  probability  1  -  c  , 

3 


(7) 


Thus, 


lim  ^2 

A  ro 


c  (1  -  c  ) 
a  a 


(8) 


This  shows  that  the  accuracy  of  the  estimate  varies 

with  the  cloud  amount.  It  is  very  good  if  c  is  near  either  zero  or 

3 

one,  and  is  worst  if  c  =0.5.  This  is  a  very  reasonable  result,  and 

a 

is  in  agreement  with  observations  on  the  accuracy  of  human  observers 
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FIGURE  1  PROBABILITY  THAT  THE  ESTIMATED  CLOUD  AMOUNT  IS 
WITHIN  ±  i  OF  THE  TRUE  CLOUD  AMOUNT 


in  estimating  cloud  amount  (Galligan,  1953).  Since  the  worst  case 

occurs  for  c  =0.5,  and  since  this  is  not  an  anomolous  situation, 
a 

we  have  us^d  this  result  to  justify  limiting  most  of  our  experimental 
work  to  this  case. 

2 .  With  Time  Averaging 

The  estimate  c  =  c  takes  no  account  of  the  spatial  and 

temporal  continuity  of  the  cloud  cover.  It  jumps  back  and  forth  between 

c  =  0  and  c  =  1,  and  would  not  be  considered  seriously  in  practice.  A 

more  reasonable  estimate  can  be  obtained  by  smoothing  or  averaging  the 

readings  taken  at  previous  times  .  Suppose  that  readings  are  taken  every 

t  h 

AT  seconds.  If  c^  denotes  the  reading  obtained  at  the  k  sampling 
time,  then  the  estimate  of  c  obtained  by  averaging  m  readings  is  given 
by 


c 

m 


1  m 

-  £ 

mk=l 


(9) 


The  physical  interpretation  of  this  estimate  is  simple. 
If  the  clouds  are  moving,  then  the  instrument  is  sampling  the  clouds 
along  a  line  in  plan  view  that  goes  through  the  instrument  location  and 
along  the  direction  of  cloud  motion.  If  the  cloud  speed  is  v,  then  the 
samples  are  a  distance  vAT  apart  in  space,  and  the  samples  used  to  com¬ 
pute  c  extend  over  a  distance  of  mvAT  =  vT,  where  T  is  the  total  aver- 
m 

aging  time.  Note  that  if  we  keep  T  fixed  and  decrease  AT,  then  m  =  T/AT 

will  increase.  In  the  limit  as  AT  goes  to  zero,  6  approaches  the 

m 

average  of  c(x,y)  along  the  line  of  length  vT.  This  line  is  as  close 
as  a  single  instrument  car.  come  to  obtaining  c,  the  average  of  c(x,y) 
over  the  entire  area  A, 

Appendix  C  gives  a  derivation  of  the  expected  squared 

2 

error  a  for  the  estimate  c  .  The  analysis  is  limited  to  the  special 
m  m 

case  where  A  =  »  and  c  =0.5,  and  neglects  the  effects  of  cloud  over- 

a 

lap.  Its  chief  significance  is  that  it  provides  us  with  a  criterion 
for  determining  the  sampling  interval  AT.  Thk.  results  of  that  analysis 
show  that  if  the  averaging  time  T  is  fixed  and  if  m  =  T/AT  is  increased 
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by  letting  AT  approach  zero,  then  the  expected  squared  error  decreases 

-  2 

to  a  minimum  value  given  by 


-2vT/ t  1 

a 

frtTJ— j 


(10) 


2 

The  variation  of  5  .  with  vT/1  is  shown  in  Figure  2. 

_  min  n 

a  2  A  2 

If  AT  is  not  rero,  c  is  greater  than  a  ,  but  it  may  not  be  much 

m  2  min 

greater.  The  exact  increase  in  6  depends  on  both  vAT/I  and  vT fl  , 

a  d  n 

and  the  details  ore  discussed  in  Appendix  C.  The  basic  result  of  this 

2 

analysis  is  that  the  percentage  increase  in  3 must  lie  between  the  two 
bounds  shown  in  Figure  3.  The  upper  bound  corresponds  to  the  no-averaging 
case  where  T  =  AT,  and  the  lower  bound  corresponds  to  the  infinite¬ 
averaging  case  where  T  =  * . 

_  2 

Note  that  in  the  worst  case  a  is  less  than  20<  greater 

m 

2 

than  o  if  AT  =  0.25  1  /v,  and  a  further  reduction  in  AT  dees  not 
oin  m 

yield  much  further  benefit.  Thus,  for  example,  if  the  mean  cloud  length 
is  200  ft.  and  the  cloud  speed  is  10  ft/sec,  there  is  little  value  in 
sampling  faster  than  cnce  every  five  seconds.  A  very  similar  analysis 
shows  that  the  percentage  increase  in  the  variance  of  the  base  height 
estimate  is  small  if  AT  =  O.Sd/v,  where  d  is  the  correlation  distance 
(see  Appendix  E j .  We  suggest  the  use  of  this  criterion  of  percentage 
increase  in  the  minimum  expected  squared  error  for  determining  the 
required  sampling  interval  AT. 

It  might  appear  thnt  Eq.  (10)  also  supplies  a  criterion 
for  determining  the  averaging  time  T.  Unfortunately,  an  uncritical  use 
of  this  result  suggests  making  the  averaging  time  infinite  to  obtain 

_  2 

o  =0.  This  is  a  limitation  of  the  analysis,  which  was  for  the 
min 

special  case  A  =  ».  When  the  area  is  finite,  the  use  of  a  iong  averaging 
time  will  prevent  the  estimate  c  from  -jeing  a  lie  to  follow  fluctuations 
in  cT,  the  cloud  amount  over  the  area.  In  general,  the  averaging  time 
is  also  the  time  that  is  required  for  c  to  be  able  to  respond  to  a  sys¬ 
tematic  change  in  the  cloud  amount.  This,  not  Eq .  (10),  is  the  criterion 
that  should  be  used  in  selecting  T.  We  shall  return  to  this  question  when 
we  examine  the  results  of  averaging  the  responses  of  several  instruments. 
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FIGURE  2  DEPENDENCE  OF  THE  MINIMUM  EXPECTED  SQUARED  ERROR  ON  AVERAGING  TIME 


FRACTIONAL  INCREASE  IN  EXPECTED  SQUARED 


D.  Multiple- Instrument  Sampling 
1 .  Spatial  Sampling 

The  use  of  more  than  one  instrument  allows  the  sampling 
of  cloud  conditions  at  various  points  in  the  area  A.  In  particular,  if 
an  infinite  number  of  instruments  are  used,  c(x,y)  can  be  determined  at 
every  point  in  the  area.  Thus,  the  average  cloud  amount  c"  can  be  deter¬ 
mined  exactly,  and  zero  expected  squared  error  can  be  obtained.  This 

is  in  contrast  with  the  case  of  a  single  instrument,  which  in  theory 

*  2 

can  not  achieve  an  expected  squared  error  less  than  ct  . 

min 

In  considering  the  use  of  more  than  one  instrument,  one 
of  the  first  questions  that  arises  concerns  the  instrument  locations. 
This  is  the  problem  of  spatial  sampling.  Intuitively,  it  is  clear  that 
the  instruments  should  not  be  concentrated  in  one  locality,  but  should 
be  distributed  over  the  area  to  obtain  more  independent  information. 
Whether  or  not  the  distribution  should  be  isotropic  depends  on  prior 
information.  For  example,  if  at  a  particular  airport  the  clouds  almost 
always  come  from  a  particular  direction,  it  seems  clear  that  the  instru¬ 
ments  should  be  arranged  along  a  line  at  right  angles  to  that  direction. 
However,  if  the  clouds  are  equally  likely  to  come  from  any  direction, 
then  it  would  appear  that  the  instrument  configuration  should  not  have 
preferred  directions,  if  that  is  possible. 

In  general,  the  optimum  instrument  configuration  is  one 
that  minimizes  the  expected  squared  estimation  error,  where  tne  expec¬ 
tation  is  with  respect  to  all  of  the  raudom  variables.  To  obtain  an 
analytical  solution  to  this  problem  would  require  knowledge  of  the 
joint  probabilitv  that  a  given  subset  of  the  n  instruments  is  detecting 
a  cloud.  This  joint  probability  is  a  complicated  function  of  th«  rela¬ 
tive  positions  of  the  instruments,  and  an  analysis  of  the  problem  is 
very  difficult.  Lacking  an  analytical  solution,  we  have  adopted  a 
heuristic  procedure  that  seems  to  provide  an  acceptable  solution.  This 
procedure  is  based  on  the  observation  that  if  a  cloud  is  present  above 
point  (*j,y^)»  and  cloud  statistics  are  isotropic,  then  the 

probability  that  a  cloud  is  also  present  above  point  (x  ,y  )  is  the  same 

fa  fa 

for  all  points  (x  ,y  )  on  a  circle  centered  at  (x  ,y  ) .  This  suggests 

fa  fa  11 
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that  the  instruments  should  be  located  to  maximize  the  minimum  distance 
from  any  instrument  to  either  another  instrument  oi  the  boundary  of  the 
area  A.  Some  solutions  for  a  square  area  are  shown  in  Figure  4;  other 
solutions  can  be  obtained  from  these  by  rotation  or  reflection. 

2 .  Without  Time  Averaging 

Let  (x  ,y  )  give  the  location  of  the  i**1  instrument, 
i  =  1,  . . n.  Then  the  response  of  the  i  instrument  yields  c(x 
If  all  instruments  are  read  at  the  same  instant,  then  the  average  of 
their  responses  provides  an  instantaneous  estimate  for  c: 

1  n 

c(n)  =  -  £  c(x  ,y  )  .  (11) 

n  .  ,  i  i 
i=l 

Figure  5  illustrates  the  behavior  of  c  and  c(n)  for  a 

typical  case.  The  solid  curve  in  this  figure  shows  the  time  variation 

of  the  cloud  amount,  c  lor  a  5  km-by-5  km  airport  area.  The  plan-view 

parameters  for  the  cloud  layer  were  c  =  0.1,  l  =0.5  km,  v  =  5  m/s ec, 

a  m 

p  =  1,  and  i|r  =  0.  Note  that  t.he  average  of  c  over  the  6-hour  period 

was  close  to  tee  large-area  cloud  amount  c  ,  but  that  significant  flue- 

tuations  in  *c  occurred  as  cloud?  moved  in  and  out  of  the  area.  The 

dotted  line  shows  the  behavior  of  c(n)  when  n  =  4  instruments  sampled 

the  sky.  When  uo  time  averaging  is  used,  c(n)  jumps  back  and  forth 

between  itu  n  1  possible  values — 0,  1/n,  2/n,  ...,  1.  This  erratic 

behavior  is  typical  for  this  instantaneous  estimate  of  the  cloud  amount  , 

Appendix  D  contains  a  derivation  of  the  expected  squared 

error  for  this  estimate  for  a  special  case  in  which  c  =0.5  and  in  which 

a 

the  n  instruments  are  uniformly  arrayed  along  a  line  segment  of  length 
w.  If  we  assume  that  the  area  A  is  a  square  having  sides  of  length 
w  =  /A,  then  the  results  of  this  analysis  are  as  follows: 


A  2 .  . 

a  (n) 


E[  (c(n)  -  o')2] 
1 


i  u  1+0  2@( i-en)  r 

1  u  1 

j  n  1-0  +  n(l-9)  [ 

n(l-9)J 

-1  -  e 


-u  sinh  u 


u 


(12) 
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i  l 

-  0  =  - 

2  ( 1  +  COS  15*)  4 

a(l  +  2  SIN  15#) 


.  (  URE  4  SPATIAL  SAMPLING  CONFIGURATIONS 
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AMOUN  r 


VlCWnil  h  JNHTANTANKOUW  CLOUD  AMOUNT  KHTIIMTRP  HY  KOUlt  INHTMlMKNTS 


where 


«  =  /Vi  . 

s 


Iks,  for  ttis  cue  tte  nv«cted  soured  ern»r 
obU  <«$  :hf  m«6er  s  of  iKtmnts  cad  tbt  wnaliird  area  Vi  2 -  Ft*- 
jTpi:citt‘as,  n  is  iiterotiat  to  ask  im  «m  intnants  are  ntdtd 
o  obtiis  a  f’rm  tprcltS  t^tand  error.  An  Bq.  \  13)  *b  solved 

Mer:caiir  for  a  as  a  faartioa  of  *  xmi  .V l  .  the  resalts  sham  ia 

* 

Fljttte  6  sere  obtaiace.  Ia  this  graph  a  is  t  ref  ted  as  a  coatiaaoet 

rxri sblt,  although,  of  coarse,  oeir  integral  value*  u-dce  sease.  The 

actaal  aatVr  of  imtnants  seeded  to  ohtaia  a  given  expected  squared 

error  is  the  first  iattger  greater  tkaa  or  equal  to  a. 

S^e  that  a  is  a  increasing  reaction  of  .  Vi  -  If 

.  VJ^  =  0,  error-free  performance  cam  be  obtaistd  aith  ooly  oae  iastra- 

seat.  As  . Vl  approaches  infinity,  the  amber  of  iott  naeer.ts  aeeded 
*_2 

approaches  1/4  ~  ;  this  correspi  arfa  to  the  fonala 

c  (1-c  ) 

E~ <c(e)  -  c  -  — - —  ,  CIS* 

s  -  a 

which  is  -a  lid  if  eke  fast  rarest  read  tags  are  statistically  independent. 

Became  Eq.  (121  was  derived  for  a  special  oue-dineasinaal 
case,  z  series  of  twenty-four  experiments  were  performed  to  see  whether 
or  ne-t  it  rewaiaed  valid  for  ow  more  ge acral  cloud  model .  Ia  all  of 
these  experiments  the  airport  area  was  square,  ud  the  large— *  re*  cloud 
vucst,  c^,  -was  five  tenths.  The  size  of  the  area  tad  the  a-u  ieagtk 
of  the  clouds  were  Taried  to  investigate  the  range  O.i  .Vi  s  25, 

3L 

zad  the  saber  of  instruments  used  rasged  fro*  i  to  4.  Vhes  the  area 
ui  aligned  jC  that  two  sides  were  parallel  to  the  direction  f  cloud 
ni-iicc,  ibe  di f fere*sces  between  i he  sample  mean  squared  errors  am  the 
value  of  f*n)  given  fcy  Eq.  i»2>  were  not  statistically  significant. 

>=ec  the  instrument  configuration  .-as  rotated  45*  within  the  airport 
a rec,  a-oate  variation  in  the  results  was  observed,  the  greatest  change 
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lx  i r.g  a  jo  percent  increase  In  a(n>  over  the  theoretical  velue. 

This  suggests  that  5(n)  depends  on  the  shape  and  orientation  of  the 
cloud.;  as  well  as  their  mean  length.  However,  the  general  validity  of 
the  results  was  verified  and  we  believe  that  Eq.  (12)  provides  a  good 
approximate  solution  for  the  general  case. 
i  .  frith  Time.  Averaging 

As  in  the  single— instrument  case,  tae  expected  squared 
error  can  be  reduced  by  forming  a  time  average  of  the  estimate  c(n) . 

If  »e  lor  c  (x.,y.)  denote  the  reading  obtained  from  the  instrument  at 

K  X  1 

(x.,y  )  at  the  sampling  time,  then  the  estimate  of  £  obtained  by 
averaging  a  readings  is  given  by 


c  (n) 
a 


l  *  f'  n 

~  £  ~  ^  c  (x.,y  ) 

w  n  k  a  x 

k  =  l  1  =  1 


i  n  Ti  °  1 

r  ±  I  c  (x  ,y  )  I 

n  i=i  [  k=i  k  1  1  J 


(16) 


The  physical  interpretation  of  this  estimate  of  c  is 
siaiiar  tnat  given  for  a  single  instrument.  If  the  clouds  are  moving, 
then  each  instrument  is  sampling  the  clouds  along  a  line  in  plan  view 
that  gees  through  the  instrument  location  and  along  the  direction  of 
cit  v.i  or. .  If  the  cloud  speed  is  \,  and  ir  the  averaging  extends 
ever  .«  t  ro  T ,  then  the  tern 


—  I  c,  (x.,y.) 
“  k=l  *  1  1 


is  t-r  svr.t  :a:iy  the  average  of  c(.„,y)  along  a  ime  of  length  .  .  The 

n—  ins t rvrsent  estimate  c^(n)  is  merely  the  average  of  these  estimates. 

In  considering  the  expected  squared  error  for  c  (n), 

m 

one  avast  rtcesber  that  both  this  estimate  and  the  quantity  being  esti- 

c,  vary  wit..  time.  Since  the  computation  of  c  (n)  involves 

is 

readings  during  the  past  T  seconds,  c  (n>  at  time  t  tends  to  be  a  better 


>st  irate 


or  c  at  time  t-T»'2  than  of  c  at  time  t.  Hero  we  see  an  important 
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distinction  between  needs  for  aviation  applications  and  needs  for  gen¬ 
eral  meteorological  purposes,  since  the  former  is  concerned  with  present, 
cloud  amount,  while  the  latter  can  usually  tolerate  the  delay  of  T/2 
seconds.  In  our  experiments,  the  optimum  averaging  time  for  aviation 
applications  was  found  to  be  approximately  one  half  the  optimum  averaging 
time  for  general  meteorological  applications.  Thus,  by  computing 
two  running  averages  over  consecutive  periods  of  duration  T/2 
seconds,  both  estimates  can  be  provided  with  no  more  computation 
that  that  needed  for  general  meteorological  purposes  alone. 

In  the  remainder  of  this  section  we  define  the  error  to 
be  the  difference  between  the  value  of  c  (n)  at  time  t  and  the  value  of 

o  m 

c  at  time  t-T/2.  Let  a  (n)  denote  the  expected  squared  error.  In  gen- 
2  m 

eral,  a  (n)  is  a  function  of  all  of  the  variables:  A,  n,  AT,  i  ,  p,  c  , 
m  m  a 

v,  ill  ,  and  t  -  t..  As  usual,  we  neglect  the  effect  of  t  -  t,  ,  and  to 
c  e  b  e  b 

simplify  -he  problem  we  take  p  =  1,  c^  =  0.5,  and  ^  =  0.  In  the  special 

case  A  =  <=,  we  can  perform  an  analysis  like  that  described  in  Section 
III-C-r.' .  In  that  case,  the  n  instruments  can  be  located  infinitely  far 
apart,  and  it  turns  out  that 


lim  ,,  2.  x  m 

.  a  (n)  =  - 

A  “  m  n 


This  kind  of  analysis  is  useful  because  it  shows  that  the 

same  criterion  for  choosing  the  sampling  interval  AT  is  valid  in  the 

n-instrument  case,  and  thus  AT  can  be  eliminated  from  the  list  of  var- 

„  2 

tables .  However,  the  dependence  of  6  (n)  on  A  is  of  considerable  interest 

m 

for  airport  operations.  Since  analysis  of  this  case  is  very  difficult, 

we  used  the  computer  simulation  program  to  investigate  the  behavior  of 
„  2 

a  (n)  experimentally .  In  these  experiments  we  kept  vAT/f  belcw  0.25, 
m  m 

so  that  no  appreciable  improvement  could  be  obtained  by  reducing  AT. 


For  one  case  with  / A/ l  -  10,  we  allowed  vAT/f  to  reach  0.5.  This  is 

m  m 

still  a  fairly  small  value,  and  should  not  have  had  significant  influ¬ 
ence  on  the  results. 
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It  was  found  convenient  to  normalize  the  remaining  variables  A,  n,  T, 

2 

l  and  v  so  that  a  (n)  was  expressed  in  terms  of  n,  /A/i  ,  and  vT//A. 
mm  m 

The  experiments  were  restricted  to  the  cases  n  =  1  and 

n  =  A,  /A/ l  -  1,2,5  and  10,  and  0  <:  vT//A  £3.0.  For  each  of  these 
m 

cases,  the  simulation  program  was  used  to  ironerato  clouds  having  these 

2 

statistics,  and  o  (n)  was  computed  by  com; sting  the  average  of  the 

m 

squared  error  at  1080  different  times. 

The  results  of  these  experiments  are  shown  in  Figure  7 . 

2 

When  vT//A  is  zero,  a  (n)  gives  the  standard  deviation  of  the  instnn- 

m 

taiieous  (unaveraged)  estimate.  Thus,  the  ordinate  intercepts  of,  these 

curves  give  values  corresponding  to  Figure  5,  As  vT//A  is  increased, 

'> 

j“(r.)  decreases  to  a  minimum  value,  and  then  begins  to  increase  again, 
m 

Roughly  speaking,  a  small  amount  of  time  averaging  smooths  the  estimate 
and  :s  beneficial,  but  with  too  much  ave "aging,  c  (n)  is  unable  to  follow 

4tl 

the  ;  mutilations  of  c,  and  the  error  inert-? ses .  If  n  is  large  and  the 
instruments  are  arranged  in  a  uniform  array  spaced  a  distance  /A/n  apart, 
it  can  be  argued  that  the  optimum  averaging  time  should  be  given  by 


(18) 


since  ‘his  is  the  longest  averaging  time  that  maintains  the  sampling 

within  the  area  A.  Inspection  of  Figure  *  -*  •*»;?  thr.t  the  optimum  aver- 

a<, .  -n-  i\  ,  :  V :  •  t  ■■■  .-ally  larger  than 

i  ,  .  ,  ..  i  -  .  .a .  .-to  .  11  -  1;.  tH'.ibably  because  some 

;  ■  m 

a  •;  i i.t  ,.:c  -•  .■»  nuips  to  make  up  for  the  lack  of  information 

a‘  ..  w.  thin  A  .  in  any  event,  it  is  clear  that  the  choice  of 


ihv.  4  r.  for  tunate,  since  one 


iii.q  -  <.o, _  r  1  ho  averaging  to  changes  in  the  cloud  speed, 

and  nr;.  r.;  vo  U>  accept  a  fixed  averaging  time.  In  addition,  it  should 
be  kept  in  mind  that  these  results  are  for  a  statistically  uniform  case, 
and  •""•raying  time  should  never  be  allowed  to  become  so  long  that 

the  •  o  ...nab 3 «  lo  respond  to  the  onset  oi  a  new  weather  condition. 


Best  Available  Copy 
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In  general,  c  (n)  depends  on  both  vT//A  and  fh/l  .  If 

b  m 


we  choose  T  to  ainiaize  o  (n)  and  let 

IB 


5  .  (n)  =  “in  5  (n) 
“ln  VT//A  B 


then  ui'i  can  be  significantly  less  than  a(n),  the  root-mean-square 

error  vi.hout  averaging.  The  amount  of  improvement  depends  on  fk/l  . 

m 

As  /i  approaches  zero,  the  optimum  averaging  time  approaches  zero, 

and  no  improvement  can  be  obtained.  On  the  other  hand,  as  /A/£  approaches 

2  m 
infinity,  5  (n)  approaches  c  (1  -  c  )/n,  whereas  a  .  (n)  approaches  zero. 

a  a  min 

Since  .  (n)  also  approaches  zero  as  /A /l  approaches  zero,  it  follows 
mm  m 

thfit  r  ,  (n)  must  reach  a  peak  at  some  intermediate  value  of  Jk/l  .  The 
min  m 

experimental  results  shown  in  Figure  7  confirm  this  prediction,  with  a 

maximum  error  near  fb/L  =  4.0  for  n  =  1  and  near  /A/£  =  10. 0  for  n  =  4. 

d  m 

These  results  make  it  clear  that  the  instrument  requirements 

suggested  by  Figure  6  are  pessimistic.  By  averaging,  one  can  obtain  the 

same  performance  with  fewer  instruments.  In  particular,  Figure  8  shows 

that  a  roct-mean-square  error  of  0.17  or  less  can  always  be  obtained  with 

one  instrument,  and  0.085  or  less  can  always  be  obtained  with  four  instru- 

„  2 

ments .  Assuming  that  a  .  (n)  varies  inversely  with  n,  we  obtain  the 

min 

instrument/performance  curve  shown  in  Figure  9.  This  represents  the  best 
performance  that  can  be  obtained  with  a  set  of  n  ideal  vertically-pointing 
instruments . 

E .  Scanning  Instruments 

An  ideal  scanning  instrument  measures  the  range  r  from  the 

instrument  to  the  intersection  of  its  line  of  sight  with  a  cloud.  By 

changing  che  azimuth  angle  0  and  the  elevation  angle  0,  many  ceiling 

height  measurements  can  be  obtained  from  a  single  instrument  (see  Figure 

10) ,  The  basic  sampling  problem  is  to  determine  the  number  of  values 

of  9  and  0  that  are  necessary  to  obtain  a  given  performance. 

In  the  case  of  a  single  cloud  layer,  one  scanning  instrument 

is  equivalent  to  a  number  of  vertically-pointing  instruments.  If  the 

th 

base  height  is  h,  and  if  the  i  range  reading  is  taken  at  elevation  0^ 
and  azimuth  0^,  then  the  equivalent  vertically-pointing  instruments  are 
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FIGURE  8  REDUCTION  OF  ROOT  MEAN  SQUARE  ERROR  BY  OPTIMUM  AVERAGING 


located  at 


x 

* 


-  h 


cvs  0. 

_ 1 

tan  a 

i 


(2t») 


and 


sin  0. 

y  _  h  - J 

3  i  tan  tl. 


<21> 


Thus,  the  equivalent  instrument  locations  vary  with  the  height 
of  the  cloud  layer.  If  a  iixeo  set  of  angles  S.  and  *5.  are  used,  then 

1  i 

the  equivalent  instruments  move  away  from  the  origin  as  the  cloud  layer 
rises.  At  high  enough  altitudes,  the  only  equivalent  instrument  that 
will  remain  within  the  airport  area  will  be  the  one  corresponding  to 
a  -  9CT  .  If  only  those  readings  corresponding  to  equivalent  instruments 
within  the  airport  area  are  used  to  estimate  cloud  amount,,  then  tire 
effective  number  of  equivalent  instruments  may  become  small  for  high 
c louds . 

An  even  more  serious  problem  arises  when  the  clouds  are  very 
low  and  the  equivalent  instrument  locations  are  near  the  origin.  In  this 
case  a  very  small  elevation  angle  must  be  used  to  get  samples  representing 
conditions  come  Distance  away,  and  accuracy  problems  become  severe.  Since 
low  ceiling  heights  are  of  particular  importance  to  aviation  applications, 
this  problem  nay  limit  the  effectiveness  of  one  scanning  instvument  as  a 
substitute  for  several  vertically-pointing  instruments. 

Figure  11  illustrates  two  different  ways  to  choose  elevation 
angles.  In  Figure  'a)  the  angular  increments  are  equal,  tihile  in 
Figut?  1 l ( o)  the  distances  between  adjacent  samples  are  equal.  The 
former  method  leads  to  simpler  equipment,  but  results  in  nonuniform 
sampling  for  low  cloud  layers.  These  diagrams  show  scanning  in  a  single 
vertical  plane.  Scanning  in  three  dimensions  can  be  done  by  scanning  in 
several  planes  at  different  azimuth  angles.  An  alternative  that  is  a 
thr:  e-d i mensional  version  of  equal- increment  sampling  is  to  sample  at 
points  evenly  distributed  over  a  hemisphere.  The  three-dimensional 
version  of  the  equal-distance  case  is  to  sample  at  points  evenly  distri¬ 
buted  over  a  horizontal  plane. 
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M01JHK  1 U  INCUKMJJNT  IN  Ki.KVATlON  ANOl#K  RKQUIHKn  TO  ORTMN  TMK 

SAMK  I'H'iiKOHMANC'K  Art  n  VJCUTZCAU.Y-I’OINTINO  INHTK UMKNTN 


Tbrjt'  i.vastr*>nts  place  fairly  severe  conditions  on  :1k*  use 
•  •I  a  •lat!-  scanaisjc  instrument  as  a  replacement  for  n  vertically  }»oiat- 
toc  lastrvKsts  .  la  aWitioa  tc  the  supilar  proolea,  other  difficulties 
arise  «ber*.  riafe  readings  at  lo«  elevation  angles  are  atteaptcd.  These 
include  laterference  by  stwtures  and  personnel,  at-nosphenc  attesua- 
tior,  and  the  effects  of  irregularities  of  the  cloud  base.  If  a  single 
scanning  instrument  is  to  be  used,  it  *ill  probably  be  necessary  to 
accept  the-  fact  that  the  effective  area  covered  for  lent  ceiling  condi¬ 
tions  oust  be  reduced. 


3S 


IV 


CLOUD  BASE-HEIGHT  ESTIMATION 


A.  Introduction 

The  estimation  of  cloud  ba£e  heights  Introduces  three  ne* 

parameters,  the  mean  base  height  h,  the  base-height  standard  deviation 

-  ,  and  the  base-height  correlation  distance  d.  With  our  *dcl,  the 
b 

heig..i  :  f  the  base  of  the  cioud  above  a  point  (x,y)  is  assuned  to  be  a 

2 

rand on  variable  b(x,y)  with  mean  h  and  variance  j  .  The  correlation 

b 

coefficient  for  values  of  h  measured  at  tvo  points  a  distance  '  apart 
is  given  by  expi^-^d]. 

The  problem  of  base  height  estimation  for  a  single  layer  is 
to  estiuate  h  from  a  series  of  readings  of  h(x,y).  The  estimation  of 
3^  is  also  of  some  interest,  since  that  parameter  measures  the  irregu¬ 
larity  cf  the  cloud  base.  However,  the  primary  problem  is  to  determine 
h.  An  analysis  for  the  case  of  a  single  cloud  layer  is  given  in 
Appendix  E. 

When  several  layers  are  present,  the  problem  becomes  more 
complicated.  Ia  this  case,  the  problem  is  to  separate  the  readings 
from  the  different  layers.  Once  this  is  done,  the  aean  base  height 
for  each  layer  can  be  estimated.  In  addition,  the  separation  of  readings 
according  to  layers  allocs  the  estimation  of  the  cloud  amount  in  layers. 

The  simplest  of  all  situations  is  the  case  of  a  single  layer 
of  ten- tenths  cloud  cover.  In  this  case,  one  can  assume  that  the  proba¬ 
bility  density  function  p(h)  for  a  height  reading  h  is  normal: 


_  2 

If  »e  obtain  n  successive  readings  h  ,  ...,h  ,  then  n  and  can  be 

x  XX  D 

estinated  by  computing  the  sample  mean 


n 
y 
i  =  l 


b. 

1 


and  the  sample  variance 


1 

n-1 


n  -  2 

I  <h.  -  h) 


(26) 


(2T ) 
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respectively.  If  the  readings  are  independently  distributed  according 

»  -  .2 

to  Eq.  (25),  then  it  is  well  known  that  h  converges  to  h  and  ~~  con- 

2  b 
verges  to  with  probability  one  as  n  approaches  infinity, 
o 

Unfortunately,  this  approach  is  net  valid  in  all  situations 

of  interest.  At  least  four  complicating  factors  can  arise: 

<»)  There  nay  be  holes  in  the  cloud  layer. 

<2)  There  may  be  correlation  between  successive  readings. 

_  2 

(-3)  There  nay  oe  tiae  variations  in  h  and  J.~. 

o 

(4)  There  aay  be  more  than  one  cloud  layer. 

The  presence  of  several  of  these  factors  in  combination  aay 
be  much  acre  serious  than  their  presence  singly.  For  exaaple,  the  mere 
presence  of  holes  in  an  otherwise  uni  fora,  single  layer  causes  no  prob¬ 
lems  beyond  the  occasional  absence  of  readings.  Similarly,  .-orrelation 

between  successive  readings  by  itself  presents  no  fundamental  problem, 

-  ,2  2 

but  cerelv  slows  the  convergence  of  h  to  h  and  :  to  cr  .  The  presence 

b  c 

of  loth  of  these  factors  together  can  be  more  bothersome.  Suppose,  fo¬ 
e-sample,  that  a  hole  interrupts  a  series  of  correlated  readings  h  , 

th  '  *■ 

and  that  ar.  n  reading  h  is  finally  obtained.  If  enough  time  has  elapsed 

n 

so  that  h^  is  essentially  an  independent  reading,  it  is  clear  that  it 
should  receive  core  weight  than  the  earlier  readings,  and  not  the  equal 
weigh;  provided  by  Eq -  (26).  A  derivation  of  the  optimum  weighting  is 
ccaplicated  even  when  simple  Varkov  dependence  can  be  assumed,  and 
requires  knowledge  of  the  correlation  coefficient  for  successive  readings. 

Probably  the  nos z  serious  coabinttion  of  conditions  in  prac¬ 
tice  is  the  simultaneous  presence  of  holes  and  multiple  layers.  tfhile 
this  situation  can  be  ccaplicated  further  by  adding  correlation  and  tiae 
variation,  the  besic  problea  of  this  combination  is  that  one  can  no 
longer  assuse  that  h  has  a  normal  distribution.  Instead,  p(h)  becomes 
a  mixture  of  normal  densities,  one  for  each  layer.  To  state  this  for¬ 
mally,  let 


L  =  nunber  of  cloud  layers, 

r  h 

c,  =  cloud  aisount  for  k  layer, 

p  (h)  =  probability  density  function  for  height  readings 
k  , .  ,  t  h  , 


tb 


fa^  =  aean  cloud  height  for  k  layer. 


and 


2  th 

Cy_  =  cloud-height  variance  for  k  layer. 

Ve  as  suae  that  c^_  can  be  interpreted  the  probability  that  the  k 

layer  vill  contain  a  cloud  directly  over  the  instrument,  and  ihat  events 

ii»  different  layers  occur  independently .  Under  this  assumption,  the 

probability  that  the  instrument  sees  through  holes  in  the  first  k-1 

th  * 

layers  and  detects  a  hole  in  the  i  layer  is  given  by 


V 


k-1 

c  n  (i-c  > 
■  i-i  1 


k=l 

k=2t 


(2S) 


Then,  ignoring  the  cases  in  which  no  cloud  is  detected,  the  lav  of  total 
probability  yields 


L 

p(h)  =  T  pfh)  .  (29) 

k^l  k  k 


This  result,  which  shows  that  p(h)  is  a  mixture  of  the  proba¬ 
bility  density  functions  for  each  layer,  is  valid  even  if  the  component 
densities  p^h)  are  not  normal.  If  no  parametric  assumptions  can  be  made 
about  the  component  densities,  then  all  one  can  do  is  to  estimate  p(h) 


Note  that  this  equption  can  be  inverted  to  yield 


However,  it  is  the  probability  P  ,  and  not  the  cloud  amount  c  ,  that 

K  1  K 

constitutes  the  conventional  report  of  cloud  amount  in  the  kth  layer. 
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and  perhaps  associate  Its  nodes  (local  maxima)  with  layers.  If  the  com¬ 
ponent  densities  can  be  assumed  to  be  normal,  then  the  problem  is  to 
process  the  measurements  h  ,  ...,h  to  determine  the  number  of  layers  L, 
and  the  cloud  amount  P^,  mean  base  height  h^,  and  base  height  variance 
for  each  layer.  We  shall  consider  first  the  case  in  which  no  para¬ 
metric  assumptions  are  possible . 

B.  Nonparametric  Techniques  for  Density  Estimation 

Several  nonparametric  techniques  are  available  for  estimating 
probability  density  functions,  including  the  Parzen  window  method  (Parzen, 
1962),  the  k^-nearest-neighbor  method  (Loftsgaarden  and  Quesenberry,  1965), 
and  the  orthogonal-senes  method  (Tarter  and  Kronmal,  1968).  The  Parzen 
window  method  is  a  generalization  of  the  histogram  idea,  and  is  the 

simplest  computationally.  The  k  -nearest-neighbor  method  has  some 

n 

theoretically  appealing  properties,  as  does  the  orthogonal  series  method. 
However,  because  of  their  practical  limitations,  we  shall  discuss  the 
former  only  briefly,  and  the  latter  not  at  all. 

1 .  The  Parzen  Window  Method 

l^t  hj, ....h^  be  a  set  of  n  samples  independently  and 
identically  distributed  according  to  the  unknown  density  p(h),  and  let 
f(u)  be  any  function  satisfying 


and 


f(u)  0  , 

05 

r  f (u)  du  =  i 

—05 


f  (u)  du  <  ® 


(30) 

(31) 


(32) 


Then  the  Parzen  window  estimate  for 


p  (h) 
n 


1  n 


p(h) 

_1_ 

w 

n 


is  given  by 


(33) 


where  w  is  a  parameter  known  as  the  window  width.  If  w  is  small 
n  n 

compared  to  the  typical  distance  between  adjacent  samples,  p  (h)  is 
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irregular,  with  a  local  peak  at  each  sample.  If  is  large,  p^(h)  is 
a  smooth,  "out-of-focus"  estimate  of  p(x) .  If  the  number  of  samples  is 
fixed,  one  must  settle  for  a  value  of  that  gives  a  best  compromise 
between  an  estimate  that  is  too  erratic  and  one  that  lacks  resolution. 

If  the  number  of  samples  is  not  fixed,  one  can  let  w^  become  smaller 
as  n  gets  larger,  thereby  gaining  resolution  without  sacrificing  stabil¬ 
ity.  In  theory,  it  can  be  shown  that  if  w^  approaches  zero  as  n  approaches 

infinity  in  such  a  way  that  nw  approaches  infinity,  then  p  (x)  converges 

n  n 

to  p(x)  in  the  mean-square  sense  at  all  points  where  p(x)  is  continuous 
(Parzen,  1962) .  An  example  of  a  window  width  that  satisfies  these  con¬ 
ditions  is  w  =  w,//n. 

n  1 

To  see  how  this  method  might  perform  in  practice,  we  used 
the  model  to  generate  ceiling  height  data  for  a  three-layer  problem  having 
the  following  parameter  values: 


Layer 

C. 

1 

Pi 

^(m) 

°bi<»> 

1 

.333 

.333 

500 

20 

2 

.500 

.333 

600 

20 

O 

l.OCO 

.333 

640 

20 

A  sample  of  180  readings  taken  at  one-minute  intervals 

is  si.  n  ir.  Figure  13.  The  lower  layer  at  500  meters  is  clear  to  the 

eye,  Luc  it  is  harder  to  discriminate  between  the  upper  two  layers. 

When  these  data  were  used  with  a  rectangular  window  function,  f(u)  =0.5 

for  -x  i  u  £  1  and  zero  elsewhere,  and  with  w  =  100//n,  the  results 

n 

shown  in  Figure  14  were  obtained.  (For  clarity,  the  curves  for  succes¬ 
sive  values  of  n  are  displaced  along  the  ordinate.)  Note  that  the  lower 
mode  becomes  visable  after  30  samples,  but  even  after  180  samples  the 
structure  of  the  data  is  not  clear. 

The  use  of  a  triangular  window  function,  f(u)  =  1— | u | 
for  -1  s  u  £  1  and  zero  elsewhere,  gave  curves  having  a  smoother  appear¬ 
ance  (see  Figure  15),  but  again  the  trimodal  structure  was  not  clearly 
revealed.  Considerable  improvement  was  obtained  by  fixing  the  window- 
width  at  =  25  meters  (Figure  1C)  and  by  adding  readings  from  two 
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MODEL  CF.ILING-HE 


FIGURE  14  PARZEN  W INDOW  ESTIMATES  USING  A  RECTANGULAR  WINDOW 


other  simulated  instruments  (Figure  17).  Under  these  conditions,  the 
loner  layer  at  500  meters  could  be  separated  reliably  from  the  upper 
two  layers,  but  it  was  impossible  to  separate  the  upper  layers  by  inspec¬ 
tion.  This  last  result  could  have  been  anticipated  by  the  'a< t  that 
the  sum  of  two  normal  densities  whose  means  differ  by  two  or  less  than 
two  standard  deviations  has  only  one  local  maximum. 

2 .  The  k^-Neares  t- Neighbor  Method 

The  Parzen  window  method  has  been  criticized  on  the  grounds 
that  the  res. sits  obtained  for  finite  n  depend  so  strongly  on  the  choice 

of  the  window  function  f(u)  and  the  window  width  w  .  The  k  -nearest- 

n  n 

neighbor  method  avoids  this  arbitrariness.  To  estimate  p(h)  at  a  partic¬ 
ular  value  of  h,  one  finds  the  k  samples  nearest  to  h,  where  k  is  a 

n  n 

specified  function  of  a,  usually  /h.  Of  these  nearest  neighbors,  let 
h'  be  che  sample  farthest  from  h.  Then  the  k  -nearest-neighbor  estimate 


P„ao 


k  /n 
n 


2 !  h-h' 


(34) 


It  can  be  shown  that  if  k^  approaches  infinity  as  n 
approaches  infinity  in  such  a  way  that  k^/n  approaches  zero,  then  p^th) 
converges  to  p(h)  in  probability  at  all  points  where  p(h)  is  continuous 
(Loftsgaarden  and  Quesenberry.  1965).  Thus,  the  k^-neignbor-mc thod  is 
a  conceptually  simple,  convergent,  general  nonparametric  method  for 
estimating  density  functions .  Unfortunately,  when  the  k^-nearest- 
neighbor  method  was  programmed,  the  computation  of  Pn(h)  turned  out  to 
be  quite  time-consuming,  and  the  results  obtained  had  no  evident  advan¬ 
tage  for  ceiling-height  estimation.  Figure  18  shows  the  k^-nearest 
neighbor  estimate  of  p(h)  based  on  the  data  of  our  previous  example. 
While  modes  can  be  found  by  finding  height  intervals  for  which  p(h) 
exceeds  some  threshold,  nearly  the  same  results  can  be  obtained  with 
the  Parzen-window  method  with  considerably  less  computation. 
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IGURE  17  EFFECT  OF  USING  THREE  INSTRUMENTS 


C.  Analysis  of  Normal  Mlxtjres 

In  recent  years,  the  problem  of  analyzing  a  mixture  density 

p(h>  :  £  p .(h)  P  (29) 

k=l 

to  find  the  component  densities  pk(h)  and  the  mixing  probabilities  P( 
has  received  considerable  attention.  Some  authors  have  investigated 
the  problem  of  extracting  the  component  densities  when  the  mixture 
density  is  known  exr.ctly  (Medgyessy,  1961).  Others  hi.ve  analyzed  the 
more  difficult  problem  where  only  samples  drawn  from  p(h)  are  available 
(Stanat,  1968;  Wolfe,  1970).  Wolfe  derives  general  equations  for  maxi¬ 
mum  likelihood  estimates  of  the  unknown  parameters,  assuming  that  the 

number  L  of  components  is  known.  In  the  special  case  where  p  (h)  is  a 

2 

normal  density  with  mean  and  variance  ,  his  equations  yield  the 
following  conditions  on  the  estimates: 

P.  -  ~  t  P(k  |  h  '  (35) 

k  ft  . 

i=l 


ht  P(klh.) 


(36) 


„s  £  (hr\)2p<kIV 

k  i=l 


(37) 


where  P,  is 
k 

estimate  of 


the  estimate  of  P. 


k’  \ 


is  the  estimate  of  h, 


k’ 


and  P(k|h)  is  given  by 


VB>  pk 


P(k|  h)  =  L 

Cj 

j-i 


VB>  pj 


the 


(38) 


These  equations  have  a  simple  physical  interpretation. 

P(kjh)  in  the  probability  that  a  height  reading  h  came  from  sensing  the 
k1*1  cloud  layer.  Roughly  speaking,  the  estimate  of  P(  given  by  I.q .  (35) 
is  the  fraction  of  readings  coming  from  the  k*'*  layer;  since  wo  never 
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know  for  sure  wnich  layer  gave  rise  to  reading  li_. ,  the  fraction  is 

computed  u‘ ing  the  probabilities  P(kjh  ).  The  quantity  nP  is  essen- 

1  k  th 

tially  our  estimate  of  the  number  of  readings  that  caine  from  the  k 
layer.  Thus  f.qs  .  (36)  and  (37)  can  be  interpreted  as  Eqs .  (26'  and 
(27)  modified  to  account  for  our  uncertainty  as  to  the  layer  for  reading 
h  . 

l 


Unfortunately,  the  solution  of  these  equations  is  complicated 
greatly  by  the  fact  that  the  unknown  estimates  are  involved  in  the  com¬ 
putation  of  P(k|h),  Wolfe  suggests  some  iterative  procedures,  but  it 
is  clear  that  their  performance  depends  crucially  on  obtaining  good  ini¬ 
tial  values  for  the  unknown  quantities.  If  the  means  h^  are  widely 
separated  relative  to  the  standard  deviations  a..,  then  various  cluster- 

DK 

ing  procedures  can  be  used  to  obtain  the  initial  estimates.  Although 
these  procedures  are  heuristic  and  lack  demonstrably  provable  optimal 
properties,  they  may  provide  a  c cm,  lete  practical  solution  to  the  problem. 

D.  Clustering 

The  literature  on  clustering  is  quite  extensive,  with  a  number 
of  special  procedures  having  been  developed  to  meet  the  needs  of  particu¬ 
lar  problems.  Excellent  surveys  of  this  work  have  been  given  by  Ball  (1965) 
and  Sneath  (1969).  Many  of  these  procedures  can  be  described  as  iterative 
methods  for  partitioning  the  data  to  minimize  a  criterion  function.  We 
shall  examine  three  such  techniques  that  use  the  pooled  sample  variance 
as  a  criterion  function. 

1 ,  The  Pooled  Sample  Variance 

Suppose  that  we  partition  our  set  of  n  samples  {h^, ...,h^} 

into  g  subsets  or  clusters  C,,...,C  .  Let  n.  be  the  number  of  elements 
th  1  B  1  . 

in  the  i  cluster,  and  define  the  sample  mean  lr  and  the  sample  scatter 

s  for  the  ith  cluster  by 
i 


£  _1_  T.  h 

hi  nA  hcC1 


(39) 


=  i  (h-h  )“ 

hcC  1 

i 
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and 


-  2 
s  . 
l 


(•JO) 


respectively.  The  pooled  sample  scatter  Sp  and  the  pooled  sample  variance 
2 

o  for  this  partition  ol  the  data  are  defined  as 
P 


a  2 

s 

P 


(‘31) 


and 


,2  1.2 

a  =  — -  s 

p  n-1  p 


(42) 


respectively.  The  pooled  sample  variance  is  a  measure  of  the  variability 
associated  with  the  partitioned  data.  A  general  goal  is  to  find  that 
partitioning  that  minimizes  this  criterion  function. 

„  2 

It  should  be  noted  that  the  minimum  value  of  ct  depends 
on  the  number  g  of  clusters.  If  g  =  n,  i.e.,  if  each  measurement  is  a 

2  2 

cluster,  then  a  =0.  If  g  =  1,  then  a  is  merely  the  sample  variance 
’  P  P 

for  all  of  the  data.  In  general,  it  can  be  shown  that  the  minimum 
^  2 

value  of  3p  is  a  monotonically  nondecreasing  function  of  the  number  of 
clusters . 


2 .  Hierarchical  Clustering 

No  computationally  feasible  solution  is  known  for  the 

A  2 

problem  of  partitioning  n  samples  into  g  clusters  so  that  a  is  mini- 

r 

mized.  (Exhaustive  procedures  that  consider  all  possible  groupings 
are  not  computationally  feasible.)  A  number  of  heuristic  methods  have 
been  suggested  that  seem  to  give  good  results,  although  the  partitions 
obtained  are  not  always  optimal.  Among  the  simplest  of  these  are  the 
so-called  hierarchical  clustering  procedures  (Sokal  and  Sneath,  1963; 
Johnson,  1967).  These  procedures  form  a  sequence  of  clusterings  start¬ 
ing  with  the  original  data  viewed  as  a  set  of  n  clusters  and  ending 
with  all  the  data  in  one  cluster.  The  step  from  g  clusters  to  g  -  1 
clusters  s  made  by  measuring  a  distance  between  all  pairs  of  clusters 
and  merging  the  closest  pair.  Once  the  members  in  two  clusters  are 
Joined,  they  are  never  separated  in  subsequent  steps.  This  greatly 
reduces  the  number  of  partitions  that  need  to  be  considered,  aiul  leads 
to  the  hierarchical  structure  in  the  sequence  of  clusterings. 


f>3 


In  the  case  of  ceiling  height  data,  the  following 
hierarchical  clustering  scheme  is  very  effective. 

(1)  Initially,  order  the  samples  so  that  h,  ^  h„  s  ...  <  h  . 

v  12  n 

Let  g  =  n,  C  =  { } f  =  1,  and  h^  -  h^.  j.  =  1,  . ..,  n. 
This  corresponds  to  n  singleton  clusters. 

(2)  Measure  the  g  -  1  distances  between  pairs  of  adjacent 


(3) 

(4) 

(f») 

(6) 


sample  means.  Let  h^  and  be  the  closest  pair. 

Merge  C.  and  C  .  Mathematically,  replace  C.  by 

J  h  j 

C.  U  C,  and  delete  C,  . 

J  k  k 

Update  the  sample  mean  by  replacing  h_,  by 

(,J  V\V/{VV' 

Update  the  counters  by  replacing  g  by  g  -  1,  replacing 

n.  by  n .  +  n  ,  and  deleting  n  . 

J  J  k  k 

If  g  =  1,  stop.  Otherwise,  return  to  Step  2. 


Here  we  have  used  the  distance  between  sample  means  as  a 
measure  of  the  distance  between  clusters,  and  we  have  taken  advantage  of 
the  one-dimensional  character  of  the  data  to  reduce  the  comparisons  from 
the  g(g  -  l)/2  possible  pairs  of  clusters  to  the  r  -  1  adjacent  pairs. 

The  computation  required  is  very  straightforward,  with  this  description 
of  the  procedure  amounting  to  a  flow  chart  for  the  program. 

Intuitively,  it  is  clear  that  by  combining  closest  clusters 
we  are  tending  to  partition  the  data  so  as  to  minimize  the  pooled  sample 
variance.  However,  it  is  not  hard  to  find  examples  for  which  the  iesulting 
clusters  are  not  optimum.  There  is  a  simple  modification  of  this  procedure 
that  selects  clusters  C.  and  C  so  that  the  pooled  sample  variance  after 

3  ^  «  2 

merging  is  minimized.  If  we  let  ?  (g)  denote  the  pooled  sample  variance 

P 

before  merging,  then  it  is  not  hard  to  show  that 


-  2/  *  2,  ,  1 

7  (g-  1  )  -  O  (g)  +  - . 

p  P  n- 1  n  .  +  n, 


nl_Js 


j  k 


(hj  -  v 


(43) 


2  .2 

Thus,  no  matter  which  pair  of  clusters  we  merge,  4  (g-i)>  ?  (g)  .  How- 

p  P 

ever,  we  can  minimize  the  increase  in  t  in.  ponied  sample  variance  by 


‘'electing  the  pair  of  clusters  for  which  the  squared  "distance" 
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(44) 


d  *  ViL(fi  -  5,2 

jk  n^+nk  j  k 

is  minimum.  If  all  of  the  clusters  had  the  same  number  '~f  elements,  thi 
corresponds  to  selecting  the  pair  whose  sample  means  are  closest.  When 
the  clusters  have  different  sizes,  this  criterion  favors  the  growth  of 
the  larger  clusters. 

To  use  this  rule  as  stated,  it  is  necessary  to  compute 

2 

d  for  all  g(g-l)/2  pairs  of  cluster.  However,  it  is  not  hard  to  show 

2 

that  the  sairple  means  for  which  d  is  minimum  are  adjacent.  Thus,  our 

t  2 

previous  procedure  will  yield  the  minimum  increase  in  at  each  step 
if  the  distances  measured  in  Step  2  are  computed  by  Eq .  (44). 

3 .  Hierarchical  Splitting 

Hierarchical  clustering  schemes  have  the  disadvantage 
that  even  if  one  is  only  interestfd  in  partitioring  the  data  into  two 
clusters,  it  is  necessary  to  consider  n  clusters,  n-1  clusters,  n-2 
clusters,  etc.  Hierarchical  splitting  procedures  attempt  to  achieve 
similar  results  by  starting  with  the  data  as  one  cluster  and  repeatedly 
splitting  clusters  in  two. 

The  first  problem  we  face  is  the  selection  of  the  cluster 

to  split.  Since  our  goal  is  to  partition  the  data  so  as  to  minimize 

2 

the  sum  of  the  scatters  s.  ,  it  is  reasonable  to  select  the  cluster  for 
2  1 

which  is  maximum.  A  computationally  more  costly  alternative  is  to 
make  trial  splits  of  every  cluster,  selecting  that  cluster  that  ulti¬ 
mately  yields  the  greatest  reduction  in  the  sum  of  scatters. 

The  second  problem  we  face  is  the  decision  of  how  to 
split  the  cluster.  With  multivariate  data,  this  problem  is  so  severe 
that  hierarchical  splitting  methods  are  rarely  if  ever  used.  However, 
with  univariate  data,  such  as  that  provided  by  ceiling-height  measure¬ 
ments,  an  exhaustive  approach  can  be  considered.  Let  us  consider  only 
those  partitions  of  clusters  formed  by  picking  a  critical  value  h+ .  If 
the  cluster  to  be  split  contains  m  members,  then  it  can  be  partitioned 
into  two  clusters  >r  m-1  ways.  For  each  <.  i  these  partitions  we  compute 
tne  sum  o*  the  two  sample  scatter  values  given  by  Eq .  (40)  .  The  best 
partition  is  the  one  that  minimizes  this  sum. 
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J.  Sor.hir rarchical  Splitting 


Kith  hierarchical  splitting,  once  a  boundary  between  two 
clusters  is  established,  it  is  never  changed.  This  prevents  the  pro¬ 
cedure  fn>R  finding  the  ainimum-pooled-sample-varianco  solution  when 
there  are  three  or  more  clusters.  Improved  results  can  be  obtained  by 
readjusting  all  of  the  boundaries  between  clusters  after  each  splitting. 
Let  h*  be  the  boundary  between  cluster  and  i  =  l,  . ..,  g  -  1, 

Then  the  splitting  procedure  that  we  investigated  can  be  described  as 
fol lows : 

(1)  Initially,  let  g*=l  and  let  be  the  entire  data 
set . 

(2)  Make  trial  splits  to  find  the  cluster  C.  for  which 

,.2.2  »  2  1 
s  .  *  s .  is  minimum,  where  s  .  is  the  sample  scatter 
iL  lR  '  lL 

for  the  left  half  and  s  is  the  sample  scatter  for 

l  R 

the  right  half  of  C. .  Let  h*  be  the  location  of 
the  boundary  that  best  splits  that  cluster. 

(3)  Increase  g*  by  1  and  insert  h*  in  the  sequence  of 
boundaries . 

(-1)  Starting  at  h*  and  working  outward  alternately  to 

the  left  and  right,  adjust  each  boundary  h*  to  best 

split  C.  and  C.  , . 
i  l+l 

(5)  Repeat  Step  4  until  no  further  reduction  in  o  is 

P 

obtained . 

(6)  If  g*  =  g,  stop.  Otherwise,  increase  er*  by  one  and 
return  to  Step  2. 

Thus,  with  this  proced”re,  all  of  the  boundaries  between 
clusters  are  adjusted  until  no  furtner  pairwise  improvement  in  performance 
can  be  achieved.  This  still  does  not  guarantee  a  minimum-pooled-sample- 
variance  solution,  but  it  usually  results  in  better  performance  than  can 
be  obtained  by  hierarchical  splitting. 

5 .  Mode  Seeking 

Another  popular  class  of  clustering  procedures  is  the 
so-called  mode-seeking  procedures.  They  include  the  k-means  met nod 
(MaoQueon,  1967),  and  Isodata  method  (Ball  and  Hall,  1967),  and  several 
related  methods  (Rogers  and  Tanimoto,  1960;  Sebestyen,  1966).  All  of 
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these  methods  can  be  viewed  as  simplified  versions  of  Wolfe's  procedure 


for  estimating  the  parameters  of  normal  mixtures.  Tne  following  is  a 
typical  mode-seeking  procedure: 

(1)  Pick  an  initial  set  of  p  cluster  centers  n;>  .  ,.,(1. 

(2)  Partition  the  r:  samples  into  g  clusters,  with  heC 

if  |h-h.|  £  |h-h,|  for  j  =  1  to  g ,  Let  n  denote  the 
number  of  samples  in  C. • 


(3)  Recompute  the  h  by  Fq .  (39). 

A 

(4)  If  the  h^  are  unchanged,  stop.  Otherwise,  return  to 
Step  2 . 

Compared  to  the  hierarchical  schemes  described  previously, 
this  procedure  has  the  advantage  that  it  works  with  a  fixed  number  of 
cluster  centers .  The  iterative  adjustment  of  the  boundaries  resulting 
from  adjusting  the  sample  means  is  much  simpler  than  the  exhaustive  com¬ 
putation  required  by  the  nonhierarchical  splitting  method.  However, 
the  results  obtained  depend  strongly  on  the  initial  set  of  cluster 
centers . 


One  of  the  best  ways  to  find  an  initial  set  of  cluster  cen¬ 
ters  is  to  use  the  results  of  hierarchical  clustering  or  splitting 
procedures.  A  simpler  but  poorer  approach  is  to  make  a  random  selection 
from  the  set  of  n  samples.  A  third  approach  involves  the  use  of  a 
parameter  Ah,  which  is  interpreted  to  mean  the  smallest  allowed  distance 
between  distinct  cloud  layers.  Letting  g*  denote  the  number  of  cluster 
centers  selected  at  an  intermediate  stage,  we  can  describe  the  process 
of  finding  g  centers  as  follows: 

(1)  Initially  set  g*  =  1,  fi  =  h  ,  n  =  1,  k  =  1,  and  set 


n^  -  9  for  i  =  2  to  g . 


(2)  Increase  k  by  one.  If  k  >  n,  stop;  otherwise  read  hj  . 

(3)  Compute  e,  -  h  -  Ifi  for  i  =  1  to  g*.  Let  |c  j  bo  the 

i  k  i  1  m 

smallest  |  e_J  . 


(4)  If  |em|  <  Ah  or  if  g*  =  g, 


increase  n  bv  one, 

m  ' 

fi  by  e  /n  ,  and  return  to  Step  2. 
m  m  m 


increase 


(5)  Otherwise,  increase  g*  by  one,  set  h  -  h  ,  set 

g*  k 

n  =  l,and  return  to  Step  2. 
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Clearly,  this  in  itself  is  another  clustering  procedure. 

To  describe  it  in  words,  it  uses  the  first  sample  as  a  trial  cluster 
center.  As  long  as  successive  readings  are  within  Ah  of  the  first 
cluster,  they  are  just  averaged  in  with  previous  readings.  When  a  read¬ 
ing  greater  than  All  away  is  encountered,  it  forms  a  second  cluster 
center.  In  general,  a  new  cluster  center  is  formed  whenever  a  new  read¬ 
ing  is  more  than  Ah  away  from  all  of  the  previous  cluster  centers. 

One  of  the  main  advantages  of  this  procedure  is  that  it 
operates  sequentially,  and  ig  well  suited  to  processing  data  in  real 
time.  One  can  readily  think  of  variants  of  this  approach,  in  wnich, 
for  example,  cluster  centers  that  have  not  been  updated  recently  are 
deleted,  or  in  which  the  updating  in  Step  4  is  modified  to  give  more 
weight  to  more  recent  readings.  The  chief  disadvantage  of  this  method 
is  its  dependence  on  the  parameter  Ah.  If  Ah  is  too  small,  too  many 
clusters  will  be  formed,  whereas  if  Ah  is  too  large  the  opposite  will 
be  time.  When  we  generate  data  with  the  model,  we  know  the  value  of 
Ah  that  will  work.  One  is  not  so  lucky  with  real  data,  and  a  valid 
evaluation  of  this  method  can  only  be  done  experimentally  without  fore¬ 
knowledge  of  the  data. 

E.  Experimental  Results 

To  compare  the  density  estimation  and  clustering  techniques, 

a  series  of  experiments  were  performed  using  data  gener  ted  by  the  model 

and  measured  ceiling  height  data  supplied  to  us  by  the  National  Weather 

Service  Test  and  Evaluation  Laboratory.  The  two  clustering  methods  used 

were  the  nonhierarchieal  splitting  method  described  in  Section  IV-D-4,  and 

the  hierarchical  clustering  method  described  in  Section  IV-D-2.  The  former 

was  believed  to  represent  an  exhaustive  method  that  should  give  the  best 

results,  while  the  latter  was  believed  to  represent  a  computationally 

simple  method  that  would  be  an  attractive  candidate  in  practice. 

i  ,  Nonhierarchieal  Sp  1_1  ttlng 

The  nonhierarchieal  splitting  nrocedn-e  was  programmed 

and  applied  to  the  same  data  used  to  demonstrate  the  Parzen  window  and 

k  -nearest-neighbor  procedures.  This  corresuonded  to  a  three- layer 
n 

problem  with  layers  at  500,  600  and  640  meters,  each  layer  having  a 
20-meter  standard  deviation.  The  n  samples  were  the  ISO  consecutive 
readings  shown  in  Figure  1.1. 
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When  the  spli  tint;'  procedure  was  applied,  1  lie  results 

shown  in  Figure  19  were  obtained.  As  expected,  the  pooled  standard 

deviation  c  decreased  with  the  number  of  clusters.  With  all  the  data 
P 

grouped  in  one  cluster,  the  sample  mean  was  at  572  meLer.  .ind  5  was 

63  meters.  With  the  data  split  into  three  clusters,  the  sample  means 

were  at  501,  598,  and  644  meters,  and  a  was  16.3  meters.  These  sample 

P 

means  are  quite  close  to  the  true  means,  but  the  pooled  sample  standard 
deviation  is  significantly  less  than  the  standard  deviation  for  the 
individual  layers;  this  is  a  common  result  of  partitioning,  which  trans¬ 
fers  samples  far  from  the  mean  to  another  cluster.  Of  the  180  samples, 

73  were  in  the  first  cluster,  53  in  the  second,  and  54  in  the  third. 

Thus,  the  sample  probabilities  P  for  each  layer  were  0.406,  0.294,  and 
0.300.  The  deviations  from  the  true  values  of  0.333  for  each  layer  are 
not  statistically  significant. 

While  the  results  obtained  by  splitting  the  data  into 
three  clusters  are  in  good  agreement  with  the  model,  in  practice  one 
does  not  know  a  priori  how  many  cloud  layers  exist.  There  are  no  gen¬ 
erally  accepted  criteria  for  determining  the  number  of  clusters  in  a 

2 

set  of  data.  Wolfe  suggests  continuing  the  subdivision  until  a  ^“-test 
indicates  that  the  reduction  in  pooled  sample  variance  is  not  statisti¬ 
cally  significant  (Wolfe,  1970),  Ball  and  Hall  observe  that  if  a  uniform 
probability  density  is  divided  into  g  clusters,  then  the  pooled  sample 
standard  deviation  will  be  reduced  by  a  factor  of  g,  so  that  one  would 
expect  at  least  this  much  improvement  even  in  the  ab.ence  of  any  struc¬ 
ture  in  the  data  (Ball  and  Hail,  1970)  .  They  suggest  that  the  optimum 

value  for  g  is  the  one  that  minimizes  g  3  , 

P 

This  criterion  fails  with  our  data  ' localise  the  reduction 


in  5  in  going  from  one  cluster  to  two  is  so  great  that  i  he  addiiion.il, 
P 

significant  improvement  in  going  from  two  clusters  to  three  is  not 
detected.  However,  the  criterion  can  also  bo  applied  incrementally  by 


asking  that  the  o^(gil)  obtained  for  g  +  1  clusters  he  less  I  ban 
go  (g)/(g+l),  the  value  one  would  obtain  from  a  uniform  density.  Thus, 


to  continue  clustering, 
2 

less  than  —  o  (2),  etc. 

3  p 


'  (2)  must  l  xi  less  than  —  •  it),  ’  (3'  ::  u  .  i 
I*  -  V  P 

An  empirically  derived  im  d  i  i  ir  it  nui  W  tin-. 


•v 
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WEAN  BASE  HEIGHT -  meters 


criterion  is  to  ask  that  a  (gsl)  be  less  than  g  a  (g)/(g+0.8).  This 

P  V 

latter  criterion  was  successful  in  almost  all  of  our  experiments,  and 
is  the  one  we  recommend . 

However,  it  should  he  pointed  out  that  in  p  act  ice  one 
might  want  to  use  other  criteria,  or  to  impose  additional  conditions. 

For  example,  one  might  insist  on  splitting  a  cluster  if  the  sample 
standard  deviation  for  samples  in  that  cluster  is  too  great.  The  simple 
criterion  "keep  splitting  clusters  until  is  less  than  20  meters"  also 
worked  well  with  our  data.  Since  angular  inaccuracies  will  make  ceilo- 
meter  readings  for  high  clouds  more  variable,  one  might  want  to  make  any 
such  threshold  a  function  of  height.  We  mention  these  alternatives  to 
point  out  that  our  study  was  not  exhaustive,  and  further  modifications 
may  be  needed  for  an  actual  automatic  system. 

2 .  Hierarchical  Clustering 

The  hierarchical  clustering  procedure  described  in  Sec¬ 
tion  IV-D-2  was  also  programmed  and  applied  to  the  model  data.  The  results 
of  this  experiment  are  shown  in  Figure  20.  Considering  the  simplicity 
of  the  technique,  it  is  remarkable  ^hat  the  results  are  so  similar  to 
those  obtained  by  the  more  exhaustive  nonhie-rarchical  splitting  procedure. 
The  pooled  sample  variance  values  are  slightly  higher  and  the  sample 
means  are  slightly  different,  with  a  distinctly  different  four-cluster 
partitioning,  but  the  overall  results  are  essentially  the  same.  The 
following  table  summarizes  numerical  values  for  the  three-cluster 
solution . 


i 

h.  (m) 
l 

n . 
l 

P. 

1 

True  Values 

h  ( m  ) 
i 

P 

1 

1 

501 

73 

.-106 

.■>00 

.  »K>  *> 

2 

GO-1 

GO 

.383 

dOU 

.333 

3 

G3U 

38 

.211 

G-10 

.333 
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As  before,  these  results  were  obtained  by  clustering 
all  180  sampler.  To  determine  the  effect  of  the  sample  size,  the  clus¬ 
tering  procedure  was  repeated  using  the  first  5,  1C.  20,  40,  80,  and  160 
samples.  Figure  21  shows  that  behavior  stabilized  when  between  40  and 
80  samples  had  been  received.  By  the  time  80  samples  had  been  received, 
all  three  layers  uad  been  firmly  detected.  This  agress  with  the  results 
of  estimating  the  probability  density  function.  It  also  makes  sense  with 
reference  to  Figure  13,  which  shows  that  until  the  twenty- fourth  sample 
no  data  had  been  obtained  from  the  lowest  layer,  and  only  four  of  the 
first  40  samples  came  from  this  layer. 

The  hierarchical  clustering  procedure  was  also  applied 
to  some  measured  ceiling- height  data  supplied  to  us  by  the  National 
Weather  Service  Test  and  Evaluation  Laboratory  (Lefkowitz,  1970)  .  We 
selected  three  data  periods  that  seemed  to  show  qualitatively  different 
characteristics.  Figure  22  shows  the  data  from  period  50-69  for  RBC-1, 
a  rotating-beam  ceilometer  with  a  400-ft .  baseline.  Figure  23  shows 
more  variable  data  from  period  49-69  for  RBC-3,  which  had  an  800-ft . 
baseline.  Figure  24  shows  even  more  variable  data  from  period  17-70 
for  RBC-1;  only  the  primary  returns,  indicated  by  the  circled  data 
points,  were  used,  although  one  might  also  want  to  consider  the  inclu¬ 
sion  of  secondary  returns . 

The  results  of  clustering  these  data  are  shown  in 
Figure  25.  Using  the  criterion  that  a^Cg+l)  had  to  be  less  than 
go  (c)/(g+0.8),  we  obtained  the  following  results: 


Data  Period 

No.  of  Layers 

hi(ft.) 

— 

n 

r 

P. 

l 

50-69 

1 

799 

65 

1 .000 

49-69 

1 

1370 

45 

* 

.693 

17-70 

o 

225 

61 

,642 

1542 

34 

_ 

.358 

L 

1 - 

A  total  of  65  readings  were  made  during  the  period. 
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EVIATION -  meters 


12  3  4  5 

NUMBER  OF  CLUSTERS 

FIGURE  21  EFFECT  OF  SAMPLE  SIZE  ON  CLUSTERING  PERFORMANCE 
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1800 


65 


FOR  RBC-1 ,  DATA  PERIOD  50-69 


1800 


CEILING  HEIGHT  MEASUREMENTS  FOR  RBC-3,  DATA  PERIOD  49-69 


POOLED  STANDARD  DEV 


FIGURE  Gj  HIERARCHICAL  CLUSTERING  OF  CEILING  HEIGHT  MEASUREMENTS 
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In  the  case  of  data  period  49-69,  the  decision  that 
there  was  only  one  layer  was  a  marginal  one;  if  two  layers  had  been 
selected,  they  would  have  been  located  at  1165  ft.  and  1429  ft.  A 
longer  data  record  would  be  needed  to  make  a  firm  choice  between  these 
alternatives . 

F.  Remarks 

This  investigation  of  density  estimation  and  clustering  pro¬ 
cedures  was  concerned  with  ways  of  processing  ceilometer  data  to  obtain 
information  about  the  number  of  cloud  layers,  the  average  height  of 
each  layer,  and  the  cloud  amount  in  layers.  The  problem  was  shown  to 
be  one  of  analyzing  a  mixture  distribution,  a  problem  that  has  no  com¬ 
putationally  simple  solution. 

A  simple  hierarchical  clustering  procedure  was  shown  to  be 
quite  effective  on  data  generated  by  the  model,  and  gave  reasonable 
results  for  a  small  sample  of  real  ceilometer  data.  Thus,  the  basic 
feasibility  of  this  kind  of  objective  data  processing  was  established. 
However,  a  few  problems  must  be  solved  before  the  technique  can  be  used 
routinely  in  practice.  These  include  the  following: 

(1)  Verification  that  the  criterion  for  determining  the 
number  of  layers  gives  the  desired  results. 

(2)  Determination  of  the  needed  number  of  samples  for  valid 
clustering.  The  number  needed  for  valid  clustering  may 
not  be  the  same  as  the  optimum  number  to  average  for 
cloud  amount  estimation. 

(3)  Possible  modification  of  the  procedure  to  ensure  stable 
results  as  a  function  of  time. 

The  first  two  problems  require  fairly  extensive  experimenta¬ 
tion  with  actual  ceilometer  data,  but  are  not  expected  to  call  for  any 
major  changes  in  the  procedure.  The  third  problem  is  potentially  more 
difficult.  Most  of  the  clustering  procedures  Ignore  the  fact  that  the 
data  arrive  in  time  sequence,  and  that  it  might  be  desirable  to  find 
and  to  track  simple  trends.  Thus,  in  a  periou  during  which  the  cloud 
base  height  is  becoming  steadily  lower,  a  clustering  procedure  might 
well  describe  the  sky  as  containing  two  or  more  cloud  layers.  If  the 
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time  period  for  clustering  is  kept  short  enough  to  prevent  this  behavior, 
there  is  the  danger  that  the  sky  description  will  become  erratic  as 
layers  are  seen,  forgotten,  and  seen  again.  Serious  problems  of  this 
sort  could  require  additional  research  directed  at  solving  this  particu¬ 
lar  problem. 

G .  Instrument  Limitations 

Although  the  cloud/instrument  model  can  simulate  both  the  sky 
conditions  and  the  physics  of  the  instruments,  the  physics  of  the  instru¬ 
ments  were  ignored  in  this  study.  In  general,  the  physical  limitations 
introduce  systematic  and  random  inaccuracies  that  reduce  the  performance 
that  can  be  achieved.  Three  classes  of  problems  can  be  distinguished: 

(1)  Measurement  accuracy 

(2)  Failure  to  detect  clouds 

(3)  Detection  of  false  signals  . 

Measurement  accuracy  for  a  rotating-beam  ceilometer  is 
affected  by  several  factors,  such  as  the  base  line  distance,  the  cone 
angles,  and  the  levelling  of  the  instrument.  The  problems  of  measuring 
the  heights  of  high  clouds  by  triangulation  are  well  known.  In  general, 
an  inaccuracy  in  measurement  will  introdiict  the  same  inaccuracy  in  the 
resulting  calculation  of  cloud  base  height,  whether  it  is  calculcated  by 
averaging  or  by  clustering. 

Failure  to  detect  clouds  and  the  detection  of  false  clouds 
are  both  related  to  the  problem  of  interpreting  the  waveform  produced 
by  the  instrument.  The  basic  problem  is  one  of  detecting  a  signal  in 
the  presence  of  noise.  The  physical  design  of  the  instrument  can  enhance 
the  signal  by  increasing  the  power  of  the  transmitted  signal,  increasing 
the  sensitivity  of  the  receiver,  or  modulating  the  signal  to  distinguish 
ic  from  noise.  Similarly,  the  instrument  design  can  minimize  noise 
through  the  use  of  filters,  care  to  avoid  vibration,  etc.  Nevertheless, 
thermal  noise,  photon  noise,  and  interference  of  various  kinds  are 
always  present,  and  any  automatic  system  must  be  able  to  cope  with 
detection  errors  of  both  kinds. 
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Failure  to  detect  a  cloud  that  is  present  has  obvious  effects: 
cloud  amount  estimates  are  too  low,  and  cloud  height  estimates  are  more 
uncertain  than  they  need  be.  However,  a  small  percentage  detection 
failure  does  not  cause  serious  problems. 

The  "detection”  of  nonexistent  clouds  causes  much  more  serious 
problems.  These  false  alarms  are  cc;.iAonly  due  to  electrical  interference, 
solar  interference,  or  precipitation  (e.g.,  the  rainbow  effect).  To  the 
extent  that  these  causes  yield  typical  patterns,  it  should  be  possible 
to  program  the  automatic  system  to  check  for  and  e.iminate  tnem.  For 
example,  interference  might  lead  to  clustering  producing  an  abnormally 
high  number  of  clusters,  or  an  abnormally  high  pooled  sample  standard 
deviation.  Rainfall  leads  to  a  spurious  response  at  a  height  correspon¬ 
ding  to  an  elevation  angle  of  49°  that  a  computer  program  can  be  alert 
for.  Snow  leads  to  strong  responses  down  to  near  zero  elevation  angle. 

If  temperature  and  rainfall  information  is  also  available  to  the  proces¬ 
sing  program,  many  of  these  false  responses  can  be  eliminated.  This 
suggests  that  the  writing  of  the  pv-ogram  for  processing  ceilometer  data 
should  be  done  in  conjunction  with  people  who  are  intimately  familiar 
with  the  characteristics  of  the  instrument,  and  have  data  illustrating 
its  behavior. 
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V 


DETERMINATION  OF  INSTRUMENT  CHARACTERISTICS 
A.  System  Configuration 


There  are  several  ways  to  use  the  theoretical  results  of 
this  report  to  determine  instrument  and  data  processing  reauirements 
for  an  automatic  system  for  cloud-amount  and  ceiling-height  measure¬ 
ment.  These  alternatives  make  it  difficult  to  provide  a  summary  that 
is  both  simple  and  complete.  In  this  section  the  theoretical  results 
are  used  to  provide  design  curves  for  a  system  of  multiple  vertically- 
pointing  instruments  that  uses  adaptive  time  averaging  and  hierarchical 
clustering.  While  this  is  not  the  only  kind  pf  system  to  which  the 
theory  could  be  applied,  it  is  an  important  one  that  illustrates  the 
theory  well . 

The  basic  components  of  this  system  are  illustrated  in 

Figure  26.  The  n  ceilometers  are  synchronized  vertically-pointing 

instruments,  each  of  which  provides  a  ceiling-height  reading  every  AT 

seconds.  It  is  assumed  that  the  output  of  each  ceilometer  is  a  voltage 

that  gives  the  detector  response  as  a  function  of  time,  and  that  by 

calibration,  the  relations  between  time,  beam  angle,  and  cloud  height 

are  known.  The  multiplexer  samples  each  instrument  in  turn,  returning 

* 

to  the  first  instrument  in  a  small  fraction  of  AT.  It  also  samples 
an  anemometer  that  supplies  wind  speed  values;  these  can  be  adjusted 
by  theoretical  means  to  give  the  cloud  speeds  used  in  determining  the 
averaging  time  for  cloud  amount  and  cloud  height  determination.  Cloud 
speeds  may  also  be  obtained  by  inference  from  the  wind  data  obtained 
from  upper  air  soundings. 


Consider  a  rotating-beam  ceilometer  that  sweeps  180°  in  time  AT  and 

has  a  baseline  a."  To  obtain  P.  percent  accuracy  at  height  h  the 

h 

multiplexer  must  return  in  time  ( AT  ) 

nt 


AT 


m 


h  ha 

lOOrr  2  2 

h  4  a 


AT 


For  five  percent  accuracy  at  h  =  5000  ft.  with  a  =  400  ft.,  AT^  - 
0 .0010  AT. 
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The  minicomputer  has  several  tasks.  For  each  instrument 
it  maintains  a  table  (or,  to  be  more  exact,  a  circular  list)  of  the 
celling  height  readings  for  the  last  T  seconds.  It  uses  these  tables 
to  compute,  record,  and  display  the  results  of  averaging  to  obtain 
cloud  amount,  and  the  results  of  clustering  to  determine  the  number  of 
layers,  the  cloud  amount  in  layers,  and  the  base  height  of  each  layer. 

Each  time  (AT^)  the  minicomputer  is  interrupted  by  a  ceilometer 
reading,  it  checks  that  reading  to  see  if  a  maximum  response  (presence 
of  clouds)  has  been  obtained.  Every  AT  seconds  it  updates  the  tables 
for  each  instrument,  replacing  the  oldest  readings  with  the  newest  ones. 

Each  time  it  is  interrupted  by  the  anemometer,  it  updates  the  wind 

speed  record.  This  is  corrected  to  estimate  cloud  speed  and  periodically 

used  to  adjust  the  averaging  time  for  cloud-amount  and  base-height 

determination.  Finally,  the  minicomputer  may  be  called  upon  to  provide 

permanent  records  and  summary  statistics  as  desired.  These  can  be 

listed  on  the  teletype  and/cr  punched  on  paper  tape  for  processing  elsewhere. 

It  should  be  emphasized  that  this  report  is  concerned  with 
theoretical  requirements,  not  system  design,  and  other  system  configur¬ 
ations  should  be  considered.  In  particular,  the  computer  hardware 
requirements  can  be  very  sensitive  to  the  display  and  output  require¬ 
ments,  the  procedures  for  servicing  interrupts,  and  the  way  that  the 
computational  programs  are  coded.  However,  the  theory  does  provide 
absolute  minimum  computational  requirements  needed  to  achieve  a  speci¬ 
fied  level  of  performance. 

B .  Specifications,  Characteris tics,  and  Parameters 

The  theory  developed  in  this  report  relates  three  sets  of 
quantities:  performance  specifications,  instrument  and  processing 
parameters,  and  cloud  parameters.  These  are  listed  explicitly  below; 

(1)  Performance  specifications 


(a)  Root-mean-square  error  in  estimating 

cloud  amount  o 

c 

(b)  Root-mean-square  error  in  estimating 

cloud  base  height  5 

h 

(c)  Area  to  be  covered  A 
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(2)  Instrument  and  processing  requirements 


(a) 

Number  of  instruments 

n 

(b) 

Instrument  configuration 

- 

(c) 

Sampling  interval 

AT 

(d) 

Averaging  time 

T 

(e) 

Data  processing  requirements 

- 

Cloud  parameters 

(a) 

Mean  cloud  length 

0 

m 

(b) 

Cloud  amount 

c 

a 

(c) 

Cloud  speed 

V 

(d) 

Mean  base  height 

h 

(e) 

Base-height  standard  deviation 

CTb 

(f) 

Base-height  correlation  distance 

d 

In  the  remainder  of  this  section  we  will  show  how  the  per¬ 
formance  specifications  and  the  cloud  parameters  together  determine 
the  instrument  and  processing  requirements.  Although  we  restrict 
our  attention  to  the  spec,  fic  system  illustrated  in  Figure  26,  the  more 
general  applicability  of  our  results  should  be  clear.  We  will  show  how 
different  decisions  regarding  performance  specifications  and  cloud 
parameters  lead  to  different  instrument  and  processing  requirements. 

We  now  consider  each  of  these  requirements  in  turn. 

C.  Number  of  Instruments 

If  no  time  averaging  is  employed,  the  number  (n)  of  instru¬ 
ments  needed  to  achieve  a  specified  error  in  estimating  cloud  amount 

(ct  )  depends  on  the  cloud  amount  (c  ),  the  area  (A),  and  the  mean 
c  a 

cloud  length  (£  ).  We  will  only  consider  the  worst  case,  where  c  = 
m  'a 

0.5,  keeping  in  mind  that  better  performance  will  be  achieved  if  c 

is  near  zero  or  one.  Figure  6  shows  the  relation  between  n,  ct  (a  on 

c 

this  figure),  and  /5/f^.  In  general,  this  figure  indicates  the  need 
for  either  a  large  number  of  instruments  or  time  averaging. 

To  simplify  the  consideration  of  time  averaging,  we  assume 
that  the  cloud  speed  (v)  can  be  measured  and  used  to  determine  the 
averaging  time.  Although  Eq .  (18)  indicates  the  existence  of  an  opti¬ 
mum  averaging  time,  it  can  call  for  a  long  period  if  v  is  small.  This 


can  prevent  the  system  from  being  able  to  respond  to  sudden  changes  in 

conditions.  To  accommodate  this  problem,  we  introduce  a  new  parameter, 

the  maximum  allowed  averaging  time  T  .  We  assume  that  the  actual 

max 

averaging  time  is  computed  according  to 


or 


T  =  —  if  v  > 
v  T 

max 


T  =  T  if  v  s 

max  T 


/A 


max. 


(45) 


so  that  the  averaging  time  never  exceeds  T 

max 

The  choice  of  a  value  for  T  is  basically  a  trade  off 

max 

between  the  desired  time  of  response  and  cost  for  a  given  performance. 

This  in  turn  involves  how  rapidly  ceiling  conditions  can  deteriorate 

at  a  given  airpoit  and  how  important  this  is.  Too  short  a  T  time 

max 

may  yield  unrepresentative  results,  or  may  require  an  increased  number 

of  instruments  to  achieve  representative  results.  Our  graphs  show  the 

effects  of  different  choices  for  T  on  instrument  requirements;  in 

max 

our  examples  we  use  T  =  0.1  hr.,  which  seems  to  be  a  reasonable 

max 

choice . 


Figure  7  shows  how  averaging  reduces  the  RMS  error  a  (n) , 

n 

Note  that  the  case  T  =  0  corresponds  to  no  averaging,  and  the  resulting 

RMS  error  is  exactly  the  a  of  Figure  6 .  With  the  use  of  time  averaging, 

a  (n)  =  a  ,  the  specified  RMS  error.  For  simplicity,  in  the  range 
me 

vT//A  £  1,  we  approximate  the  curves  in  Figure  7  by 


The  procedure  for  finding  the  number  of  instruments  is  then 
straightforward,  and  involves  the  following  steps: 

(1)  Select  a  value  for  the  maximum  allowed  RMS  error  5  . 

c 

(2)  Select  a  value  for  the  maximum  allowed  averaging  time 
T 

max 
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(3)  Select  a  value  for  the  area  A  to  be  covered. 

(4)  Determine  the  minimum  cloud  speed  v  and  the  minimum 

cloud  length  l  of  interest  at  the  particular  area, 
m 

(5)  Compute  vT 

max 

(6)  If  vT  >  /A,  let  <5  =  2n  ;  otherwise  use  the  formula 

max  c 

2o 

<J  =  - £ - -  (47) 

‘♦(‘-Tp) 

(7)  Use  this  a  and  the  value  of  /A/ l  to  obtain  n  from 

„ .  c  m 

Figure  6. 

To  illustrate  the  results  of  this  computation,  a  series  of 

design  curves  were  obtained  in  this  way.  A  value  of  0.15  was  selected 

for  ct  ,  since  this  corresoonds  fairly  well  to  the  nerformance  of  human 
c 

observers  (Galligan,  1953) .  Ranges  of  values  were  used  for  the  other 

parameters,  leading  to  the  results  shown  in  Figure  27.  For  a  specific 

case,  consider  an  800  ft.  by  4000  ft.  area  (/A  =  0.34  mi.)  located  in 

a  region  where  time  averaging  should  be  limited  to  T  =  0.1  hr.  and 

max 

clouds  as  small  as  1  =  0.05  mi.  are  important.  Thus  vT  =  0.5  mi., 

m  r  max 

and  for  /A  =  0.34  mi.  Figure  27(b)  shows  that  for  this  case  two  instru¬ 
ments  would  be  adequate.  From  these  curves  one  can  see  how  changes  in 
any  of  the  parameters  other  than  influence  this  conclusion.  The 
formulas  allow  similar  curves  to  be  obtained  for  any  desired  RMS  error 

in  cloud  amount  a  . 

c 

D.  Instrument  Configuration 

All  of  these  theoretical  results  were  obtained  under  tile 
assumption  that,  the  instruments  are  located  as  far  from  one  another 
and  Use  lx:>undary  of  the  area  as  possible.  This  had  the  purpose  of 
obtaining  readings  from  each  instrument  thut  were  us  independent  as 
possible.  Furthermore,  for  reasonably  large  values  of  n,  this  made  the 
lx;liavior  of  the  system  insensitive  to  the  cloud  direction.  Particular 
configurations  that  meet  these  requirements  for  a  square  area  are  shown 
in  Figure  4. 

In  practice,  various  constraints  usually  prevent  the  use  of 
an  optimum  configuration.  in  addition,  climatological  considerations 
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FIGURE  27  NUMBER  OF  INSTRUMENTS  REQUIRED 

(a)  £  =  0.02  MILES 


FIGURE  27  NUMBER  OF  INSTRUMENTS  REQUIRED 

(b>  l  «  0.05  MILES 
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FIGURE  27  NUMI3ER  OF  INSTRUMENTS  REQUIRED 
(c)  JL  -  0.1  MILES 
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FIGURE  27  NUMBER  OF  INSTRUMENTS  REQUIRED 
(d)  l  =0.2  MILES 


FIGURE  27  NUMBER  OF  INSTRUMENTS  REQUIRED 
(e)  t  =0.5  MILES 


may  dictate  a  nonisotropic  configuration  to  take  advantage  of  common 
weather  patterns.  For  example,  suppose  that  it  is  known  that  at  a 
particular  location  the  clouds  almost  always  come  in  from  the  west. 

Then  the  instruments  should  be  arranged  equally  spaced  in  a  line  along 
the  west  edge  of  the  area.  Although  the  theory  does  not  specify  com¬ 
promise  solutions,  a  general  rule  is  that  the  instruments  should  be 
located  to  obtain  readings  that  are  as  independent  as  possible,  as 
quickly  as  possible. 

E .  Sampling  Interval 

The  sampling  interval  AT  affects  the  accuracy  with  which  the 
cloud  amount  and  the  mean  cloud  base  height  can  be  determined.  In  Sec¬ 
tion  11-02  it  is  shown  that  little  additional  accuracy  can  be  obtained 
if  the  sampling  interval  AT  is  smaller  than  0.25  £  / c  or  0.5  d/v,  where 

l  is  the  mean  cloud  length,  v  is  the  cloud  speed,  and  d  is  the  base- 
m 

height  correlation  distance.  In  addition,  of  course,  AT  must  be  smaller 
than  the  averaging  time  T,  which  in  turn  is  less  than  /A/v .  Thus,  AT 
is  fixed  by  the  requirement 


v 


To  determine  numerical  values  for  AT,  one  must  have  knowledge 
of  the  worst  possible  conditions  at  which  specified  performance  is  to  be 
achieved.  Specifically,  this  means  knowing  the  shortest  mean  cloud 
length,  highest  cloud  speed,  and  shortest  correlation  distance  (which 
measures  how  rapidly  a  given  base  height  fluctuates).  Figures  28  and 
29  show  how  the  required  sampling  interval  varies  with  these  parameters. 
.As  expected,  the  requirements  are  most  severe  when  the  cloud  speed  is 
large  and  when  any  of  the  lengths  d,  or  /A  are  small.  In  most  cases 

of  interest  the  area  A  is  not  the  limiting  factor.  Lack  of  data  on  the 
correlation  distance  makes  it  difficult  to  decide  whether  or  not  it  is 
more  significant  than  f>  in  determining  AT.  If  d  is  less  than  0.5  l  , 
then  d  is  the  more  important  factor.  That  is  certainly  the  case  for 
large  clouds,  but  it  may  not  be  true  for  the  smallest  clouds  of  interest. 
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In  any  case,  once  the  largest  value  of  v  and  the  smallest  values  of 
l  ,  d,  and  /A  are  selected,  AT  can  be  determined  from  Figures  28  and 
29. 

F.  Averaging  Time 

Eq .  (45)  gives  the  averaging  time  for  the  dete .mination  of 

cloud  amount.  In  addition,  an  averaging  time  is  needed  for  determining 

the  cloud  base  height,  and  these  two  times  need  not  be  the  same.  This 

latter  time  is  determined  primarily  by  the  specified  RMS  error  6^  in 

estimating  the  cloud  base  height,  or,  alternatively,  by  the  percentage 

error  in  estimating  the  cloud  base  height  (P.  ) 

h. 


(49) 


where  a .  is  the  RMS  error  in  estimating  cloud  base  height  and  h  is  the 
h 

mean  base  height. 

For  simplicity,  we  consider  only  the  single-layer  case  in 
determining  the  averaging  time.  Appendix  E  contains  a  derivation  of 
the  RMS  error  for  a  single  instrument.  For  n  widely  separated  instru¬ 
ments  the  corresponding  results  are 

a  f  -vT/d  $ 

~  b  1-e 

Ch  "  ./ nvT/id  “  vT/d  J  ’  (°  ) 

where  is  the  base-height  standard  deviation,  v  is  the  cloud  speed, 

T  is  the  averaging  time,  d  is  the  base-height  correlation  distance, 
and  n  is  the  number  of  instruments. 


This  result  can  be  simplified  further  by  assuming  that 
is  proportional  to  h.  To  give  specific  results,  we  used  the  10-minute 
data  in  Table  VII  of  Davis  (1969)  to  obtain  the  approximate  relation 


Figure  30  shows  the  resulting  relation  between  P  and  vT/d 

for  different  values  of  n  when  a  =  O.lh;  otner  values  of  cl  could 

b  b 

yield  other  curves.  By  specifying  P^  and  n,  one  can  determine  vT/d, 
and  hence  T.  Note  that  as  in  the  case  of  cloud  amount  estimation,  T 
will  vary  with  v  and  will  be  large  if  v  is  small.  Once  again  it  is 
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necessary  to  set  a  maximum  averaging  time  T  to  allow  the  system  to 

max 

respond  to  sudden  changes .  When  v  is  so  small  that  T  reaches  this 

maximum,  Figure  30  can  be  used  to  obtain  the  resulting  percentage 

error  P.  . 

n 

Example: 

Suppose  n  =  4  and  a  percentage  error  of  no  more  than  P  =  3% 

h 

Is  desired.  From  Figure  30,  vT/d  =  4.2.  Thus,  if  this  performance  is 

desired  for  v  as  small  as  5  mi/nr  and  d  as  large  as  0.5  mile,  T  must 

be  at  least  0.42  hr.  If  T  cannot  exceed.  T  -  O.i  hr,  we  cannot 

max 

achieve  3%  error  with  four  instruments  .  However,  under  these  same 
conditions,  an  averaging  time  T  of  0.1  hours  gives  vT/d  =  1,  and  our 
graphs  show  that  the  percentage  error  (P^)  will  be  4.3%. 

In  general,  the  reader  should  be  cautioned  against  setting 
specifications  that  call  for  extreme  accuracy  in  determining  the  mean 
base  height.  If  the  mean  height  is  1000  ft.  and  the  standard  deviation 
is  100  ft.,  it  is  rather  meaningless  to  ask,  say,  for  one  percent 
accuracy  in  determining  the  mean.  While  in  theory  this  accuracy  can 
be  achieved,  it  is  obtained  at  the  cost  of  excessively  long  averaging 
times,  and  from  an  operational  viewpoint  the  100  ft.  fluctuations  reduce 
the  usefulness  of  knowing  the  mean  i  •  within  10  ft. 

It  should  also  be  noted  that  we  have  used  the  mean  height  h 
of  the  clouds  in  a  given  layer  to  define  the  height  of  that  layer.  For 
aviation  applications,  it  may  be  desirable  to  modify  this  definition. 

For  example,  if  the  effective  base  height  is  defined  as  h  -  2 cr  ,  then 
the  instantaneous  cloud  height  will  be  below  the  effective  base  height 
less  than  five  percent  of  the  time.  Such  alternative  definitions  do 
not  affect  any  of  the  results  we  have  presented,  since  they  merely  make 
changes  in  the  way  that  the  effective  base  height  is  obtained  from  the 
results  of  clustering. 

G.  Computational  Requirements 

The  two  basic  computations  to  be  performed  by  the  mini¬ 
computer  are  the  cloud-amount  calculation  and  the  cloud-base-height 
calculation.  The  former  is  much  simpler  than  the  latter,  involving 
merely  the  counting  of  the  number  of  hits  obtained  during  the  averaging 
period.  This  can  be  done  in  little  more  than  the  time  required  to 
service  the  interrupts  from  the  multiplexer. 
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PERCENTAGE  ERROR  FOR  CLOUD  BASE  HEIGHT  ESTIMATION 


W ere  It  not  for  the  possibility  of  multiple  cloud  layers, 
the  cloud-base-belght  calculation  would  also  be  simple,  involving 
Merely  the  averaging  of  the  cellometer  readings  obtained  duHn*  the 
averaging  period.  However,  when  Multiple  layers  are  possible  and 
clustering  is  required,  the  computational  requirements  can  rise 
significantly.  Our  study  of  clustering  indicated  that  simple  hierarchical 
clustering  can  be  very  effective,  but  it  also  exposed  some  questions 
(discussed  in  Section  IV-F)  that  should  b*  answered  before  the  exact 
computational  procedure  can  be  established.  Despite  this  problem, 
this  simple  procedure  can  be  used  to  establish  minimum  computational 
requirements . 

A  particular  version  of  the  modified  hierarchical  clustering 

procedure  described  in  Section  IV-I>-2  and  used  in  Section  IV-E-2  is 

analyzed  in  Appendix  F.  This  procedure  analyzes  cellometer  readings 

to  give  the  number  of  cloud  layers,  the  mean  height  of  each  layer,  and 

the  cloud  amount  in  layers.  The  minicomputer  used  to  implement  this 

procedure  is  assumed  to  require  time  t  ..  for  an  addition  or  subtraction 

add 

and  time  t  ^  for  a  multiplication  or  a  division.  The  results  of  the 
analysis  lead  to  the  following  minimum  computer  requirements: 

3nT 

High-speed  memory:  words 

AT2 

Operation  speed:  7t  . .  +  t  .s'  . 

add  mui 

n  T 

Here  n  is  the  number  of  instruments,  T  is  the  averaging  time,  and  AT  is 
the  sampling  interval. 

For  example,  suppose  that  to  meet  other  performance  specifi¬ 
cations  we  oust  have  n  =  4  instruments,  an  averaging  time  as  long  as 
T  =  0.1  hr,  and  a  sampling  interval  of  AT  »=  2  sec.  Then  the  minicomputer 
must  have  at  least  2160  words  of  high-speed  memory,  and  must  take 
no  longer  than  694  microseconds  to  perform  seven  additions  and  a 
multiplication. 

These  results  show  that  the  computer  requlrementa  grow 
rapidly  with  the  ratio  nT/AT,  the  total  number  of  measurements  taken 
during  the  averaging  period.  Of  the  two,  the  speed  requirements  are 
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more  severe,  since  they  grow  approximately  as  the  square  of  that  ratio. 
Figure  31  shows  the  speed  requirements  graphically. 

Again,  it  should  be  emphasized  that  these  requirements 
have  been  derived  for  a  particular  data  processing  procedure  coded  in 
a  particular  way.  Although  they  should  serve  as  a  useful  guide  for 
minimum  requirements,  the  limitations  of  this  analysis  should  be 
remembered . 

By  directly  relating  the  specific  climatology  of  a  given 
airport  and  its  environs  to  the  parameters  of  the  model  described  in 
this  study,  one  could  specify  the  exact  system  requirements  to 
automatically  specify  the  cloud  amount  and  base  height  at  a  given 
airport.  Such  a  system  could  be  an  advantage  at  those  airports  where 
manpower  is  at  a  minimum. 
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VI 


COST  COMPARISON 


A.  Introduction 

An  automated  systea  for  Measuring  cloud  amount  and  ceiling 
height  has  the  potential  advantage  of  staking,  recording,  and  c«uni- 
cating  objective  observations  frequently  and  reliably.  However,  the 
cost  for  such  a  systec  can  be  quite  high,  particularly  when  a  large 
number  of  instruments  bus t  be  used.  This  section  compares  the  cost  of 
an  automated  system  using  n  instruments  with  the  cost  of  the  nresent 
manual  system.  For  simplicity,  our  comparison  involves  costs  only,  and 
tacitly  assumes  that  the  systems  beiug  compared  provide  results  of  equal 
value  or  benefit.  The  quescion  of  benefit  must  also  be  considered  in 
the  eventual  selection  of  the  method  to  be  used. 

The  results  of  the  cost  analysis  depend  upon  the  assumptions 
used  for  the  various  costs  involved.  The  Federal  Aviation  Administration 
has  provided  salary,  installation,  and  maintenance  support  costs,  and 
values  for  the  time  involved  in  making  weather  observations .  It  should 
be  noted  that  these  data  and  other  data  used  in  the  following  analysis 
do  not  necessarily  reflect  the  experience  or  mission  of  the  National 
Weather  Service. 

In  the  course  of  the  study,  it  was  found  that  the  frequency 
of  taking  readings  was  the  most  important  factor  affecting  the  least- 
cost  choice.  The  current  method  provides  one  report  p?r  hour  at  each 
airport,  with  more  frequent  reports  in  critical  situations.  Assuming 
that  more  frequent  reports  might  be  required  merely  by  the  projected 
future  volume  of  aviation  operations,  we  also  considered  manual  methods 
that  would  provide  more  frequent  reports  routinely. 

It  is  important  to  note  that  under  certain  conditions  the 
observer  may  have  to  make  a  number  of  instrument  readings  to  obtain 
one  recorded  report.  Thus,  we  have  given  the  cost  comparison  in  terms  of 
the  number  of  instruments  and  the  frequency  of  making  readings .  The 
reader  should  be  cautioned  that  the  number  of  readings  per  hour  is 
generally  greater  than  the  number  of  reports  per  hour. 
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Scope  of  the  Comparison 


Current  methods  of  obtaining  ceiling  information  are  as s umed 
to  consist  of  a  person  operating  a  rotating  beam  ceilometer,  performing 
calculations,  making  visual  observations,  performing  Judgment,  and  then 
recording  results.  The  cost  for  obtaining  measurements  is  thus  assumed 
to  consist  of  the  cost  of  the  ceilometer  plus  the  cost  of  a  person  s 
time  for  tbe  time  involved.  It  also  is  assumed  that  the  persons 
involved  are  provided  cost-free  facilities  by  the  larger  establishment 
(e.g.,  the  airport  weather  office)  and  that  this  larger  establishment 
can  absorb  usefully  the  remaining  fraction  of  the  observer's  time 
(nonintegral  number  of  persons)  left  over  after  obtaining  ceiling 
measurements.  In  short,  only  the  cost  of  the  ceiloaeter  and  the 
fractional  man-hour  cost  of  the  observations  (cost  of  a  possibly 
nonintegral  number  of  persons)  is  included  in  the  comparison. 

For  the  automated  system  there  is  assumed  to  be  no  cost  for 
operatirg  personnel.  Tr.e  equipment  is  turned  on  by  the  cost-free 
larger  establishment  and  the  equipment  then  produces  results. 

For  both  methods  there  is  no  cost  included  for  whet  is  dene 
with  the  results.  The  manual  system  has  done  its  job  when  it  has 
produced  handwritten  data  in  on  a  sheet  of  paper.  For  the  automated 
system,  the  job  is  done  when  a  punched  tape  or  teletypewriter  printout 
has  been  produced. 

For  both  — e t hods  the  costs  included  are: 

(1)  .Amortization  of  origincl  equipment  cost 

(2)  Amortization  of  initial  spare-parts  inventory 

(3)  Labor  involved  in  maintenance  of  equipment 

(4)  Parts  costs  in  maintenance  of  equipment 

(5)  Amortization  of  equipment  installation  costs. 

For  both  methods  the  costs  not  included  are: 

(t)  The  overall  management  function 

1 2 )  l’ t  i  1  i  t  i  es 

(3>  Office  space 

i4)  Training 

(5)  Initial  engineering  studies. 

Labor  and  equipment  costs  are  as  of  1971,  and  no  inflation  is  assumed. 
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With  the  exception  of  the  ccr inmetcr,  the  equipment  i->  lx? 


used  hr.s  not  yet  been  selected,  and  therefore  it  has  been  necessary  to 
arrive  at  costs  as  a  function  of  rough  definitions.  For  this  reason 
the  use  of  judgment  is  necessary.  To  better  bracket  this  judgment, 
high,  medium,  and  lew  equipment  costs  have  been  used. 

In  the  case  of  personnel  costs  and  utilization,  if  is  also 
not  clear  as  to  the  exact  amounts  to  be  considered.  The  personnel  cost 
given  by  the  Federal  Aviation  Adrainist ration  r.ppears  to  lie  salary  only, 
whereas  there  is  usually  about  100  percent  overhead  and/or  payroll 
burden  on  most  salaries.  Also,  the  personnel ^utilization  factor  given 
by  the  Ftedercl  Aviation  Administration  is  not  definitive.  For  these 
reasons,  it  has  been  found  necessary  to  use  judgment.  As  will  be  seen 
below,  the  judgment  is  bracketed  by  using  high,  medium,  and  low  cost 
estimates . 

C.  Cost  Assumptions  and  Data  for  the  Current  Manual  Method 

Though  one  can  assume  that  there  already  is  a  rotating  beam 
ceilometer  at  each  airport,  it  is  also  true  that  they  will  eventually 
wear  out  or  become  obsolete.  Therefore,  it  was  decided  to  amortize 
the  ceilometer  to  obtain  the  most  equitable  comparison. 

1 .  Cost  and  installation 

From  Weather  Measure  Corporation  (undated),  the  purchase 
cost  of  the  ceilometer  is  S20,000.  However,  its  installation  cost  will 
vary.  Weather  Measure  Corporation  (undated)  gives  evidence  to  support 
8n  average  of  $2,000  for  installation.  In  keeping  with  our  plan  to 
provide  a  spectrum  of  cost  estimates,  $1,000,  $2,000,  and  $3,000  have 
been  used  as  low,  medium,  und  high  installation  costs  respectively. 

2.  Initial  Spare- Parts  Inventory' 

r'or  electronic  equipment,  an  often  used,  and  usually 
reliable  factor  for  the  cost  of  initial  spare  parts  is  33  percent  of 
original  cost.  For  the  low  estimate  this  cost  was  left  out,  and  for 
the  high  estimate  it  was  doubled. 

3 .  Maintenance 

Maintenance  of  the  ceilometer  consists  of  purchased  labor 
plus  replacements  to  the  stock  of  spare  parts.  Values  of  10  percent, 
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20  percent,  and  30  percent  of  the  Initial  cost  have  been  used  as 
historically  valid  factors  for  low,  medium,  and  high  maintenance  costs, 
respectively . 

*1 .  Personnel 

i'h**  Federal  Aviation  Administration  gives  $11,700  per 
year  as  the  cost  of  a  man,  and  states  that  considerably  less  than  1/5 
of  his  utilization  is  for  the  ceiling  measurement  function.  The 
$11,700  does  net,  but  ought  to,  include  overhead.  One  would  be 
inclined  to  double  the  $11,700  to  include  ov_ihead,  and  multiply  the 
1/5  by  rj  or  i;  this  assumes  that  the  time  required  for  a  weather 
observation  is  about  1/5  of  the  observer's  time,  and  about  two-thirds 
or  one-half  of  this  is  used  in  making  the  ceiling  observation.  We 
have  chosen  the  latter,  and  arrive  at  $11,700  x  2  x  4  X  ^  ~  $2,400  as 
the  medium  manpower  cost  for  obtaining  and  recording  ceiling  measurements 
hourly.  Other  combinations  of  overhead  rate  and  utilization  multiplier 
could  arrive  at  the  same  figure.  For  low  and  high  costs,  we  use  half 
and  one-and-a-hal f  of  this  amount,  respectively,  resulting  in  $1,200, 
$2,400,  and  $3,600  per  reading  per  hour  per  year. 

When  readings  are  to  be  taken  more  frequently  than  hourly, 
we  assume  that  the  one  ceilometer  is  adequate  but  that  the  manpower  must 
increase  in  direct  proportion  to  the  number  of  readings  per  hour.  The 
resulting  cost  spectrum  for  the  current  manual  method  is  given  in 
Table  1. 

D.  Assumptions  and  Data  for  the  Proposed  Automated  Method 
1.  Cost  and  Installation 

The  proposed  automated  method  would  use  from  one  to  six 
ceilometers,  and  all  of  these  possibilities  are  analyzed  herein.  The 
cost  assumptions  for  the  ceilometer  are  the  same  as  stated  in  the 
previous  section. 

The  ceilometers  are  activated  by  and  gi^e  analog  results 
to  a  "multiplexer."  which,  in  turn,  is  driven  by  a  minicomputer.  The 
multiplexer  gives  digital  information  to  the  computer.  Fur  multiplexer 
described  in  Weather  Measure  Corporation  (undated)  and  by  the  $50G-$1,000 
cost  of  analog-to-digital  conversion  when  there  is  no  multiplexing 
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Table  1 


COST  TABLES  AND  FORMULAS  FOR  THE  CURRENT  MANUAL  METHOD 


K 

L 

* 

M 

* 

N 

* 

P 

* 

Q 

$ 

P  +  Q 

N  X  5 

Five-Year 

Cost 

Cost  per 

K  X  M 

Five- 

of  One  Cel lometer 

Time 

Reading 

Yearly 

Year 

(see  page 

102) 

Between 

per  Hour 

Cost  of 

Reading 

Lo 

1.00  B  + 

E 

+  5x.lB 

Five- 

Readings 

Readings 

per  Year 

Readings 

Cost 

Med 

1.33  B  + 

E 

+  5x.2B 

Year 

per  Hour 

(Min.) 

(labor) 

(labor) 

(labor) 

•  Hi 

1.66  B  + 

JL 

+  5x.3B 

Cost 

Low 

Cost 

1 

60 

1.2 

1.2 

6.0 

31 

37 

3 

20 

1.2 

3.6 

18.0 

31 

49 

5 

12 

1.2 

6.0 

30.0 

31 

61 

6 

10 

1.2 

7.2 

36.0 

31 

67 

12 

5 

1.2 

14.4 

72.0 

31 

103 

60 

1 

1.2 

72.0 

360.0 

31 

391 

Medium  Cost 

1 

60 

2.4 

2.4 

12.0 

48 

60 

3 

20 

2.4 

7.2 

36.0 

48 

86 

5 

12 

2.4 

12.0 

60.0 

48 

108 

6 

10 

2.4 

14.4 

72.0 

48 

120 

12 

5 

2.4 

28.8 

144.0 

48 

192 

60 

1 

2.4 

144.0 

720.0 

,48 

768 

High 

Cost 

1 

60 

3.6 

3.6 

18.0 

66 

84 

3 

20 

3.6 

10.8 

54.0 

66 

120 

5 

12 

3.6 

18.0 

90.0 

66 

156 

6 

10 

3.6 

21.6 

108.0 

66 

174 

12 

5 

3.6 

43.2 

216.0 

66 

286 

60 

1 

3.6 

216.0 

1,080.0 

66 

1,146 

* 

In  thousands  of 

dollars 

97 


function.  Various  multiplexers  are  on  the  market  costing  from  $2,000 
to  $15,000,  the  variation  being  due  to  many  things,  including  the 
multiplex  number.  For  low,  medium,  and  high  costs  for  the  multiplexer 
we  have  assumed  an  initial  cost  of  $500,  $1,000,  and  $2,000  ner 
eeilometer,  respectively. 

The  minicomputer  maintains  the  clock  function  and  drives 
the  multiplexer.  It  then  obtains  digital  information,  and  computes 
and  states  results.  An  examination  of  the  relative  costs  showed  a 
minicomputer  to  be  much  less  costly  than  the  possible  use  of  a  shared¬ 
time  computer.  This  is  especially  true  as  the  desired  readout  frequency 
increases.  Minicomputer  costs  run  from  $5,000  to  in  excess  of  $25,000. 
Those  costs  in  excess  of  $25,000  -re  usually  for  complex  output  devices. 
It  is  assumed  here,  however,  that  the  output  is  a  punched  paper  tape 
and/or  a  teletype  printout.  Values  of  $10,000,  $15,000,  and  $25,000 
have  been  assumed  to  indicate  low,  medium,  and  high  costs,  respectively. 

2 .  Initial  Spare  Parts  and  Maintenance 

For  the  multiplexer  and  the  minicomputer,  the  same 
assumptions  are  made  as  for  the  ceilometer  concerning  cost  of  initial 
parts,  stocks,  and  maintenance  costs.  It  is  assumed  that  each  item  of 
equipment  has  an  average  expected  life  of  10  years . 

3 .  Personnel 

No  personnel  costs  are  involved.  The  cost  spectrum  for 
the  proposed  automated  method  is  given  in  Table  2. 

E.  Findings 

Figures  32,  33,  and  34  show  the  numerical  results  in  graphic 

form  as  obtained  from  Tables  1  and  2.  The  diagonal  lines  on  these 

figures  represent  the  five-year  cost  of  ceiling  readings  using  the 
present  method.  Near  the  right-hand  edge  of  this  graph,  at  60  minutes 
between  readings,  the  current  costs  can  be  read.  Low,  medium,  and  high 

are  about  37,  60,  and  84  thousand  dollars,  respectively,  as  can  be 

verified  in  Table  1.  If  readings  are  to  be  taken  at  intervals  shorter 
than  60  minutes,  then  more  manpower  will  be  required.  Moving  left 
across  the  graph,  as  the  reading  interval  decreases  the  cost  increases. 
On  the  graphs  it  can  be  seen  that  if  readings  are  to  be  taken  every  two 
minutes  with  the  present  method,  then  the  five-year  costs  —  low,  medium, 
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IGURE  32  COST  OF  MANUAL  AND  AUTOMATED  SYSTEMS:  LOW  COST  CASE 
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MANUAL  AND  AUTOMATED  SYSTEMS:  MEDIUM  COST  CASE 
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RE  34  COST  OF  MANUAL  AND  AUTOMATED  SYSTEMS:  HIGH  COST  CASE 


COST  1  AI'.I.ES  A  SO  FORMULA.S  FOR  THE  PROPOSED  Al  TGMATED  METHOD 
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and  high  respectively — are  210,  410,  and  610  thousand  dollars.  This 
represents  about  a  fivefold  increase  in  cost.  Using  the  diagonal  line, 
costs  can  be  read  for  the  present  method  for  any  reading  interval 
down  to  one  minute. 

The  automated  systems  have  the  capability  of  giving  readings 
as  frequently  as  desired  above  about  15  seconds.  Thus,  on  the  graphs 
they  lead  to  horizontal  lines.  Their  costs  are  fixed  by  the  number  of 
ceilometors  that  they  use,  and  can  be  read  directly  on  the  graphs.  The 
medium  costs  for  one  through  six  ceilomc-ter  systems  respectively  are: 

75,  120,  165,  210,  255,  and  300  thousai.  1  dollars  (as  can  bo  verified  in 
Table  2)  regardless  of  the  time  interval  between  readings.  Hocause 
they  use  more  equipment,  it  can  be  seen  that  at  readings  of  60-minute 
intervals  they  are  always  more  costly  than  the  present  method;  but  if 
readings  are  to  be  taken  every  two  minutes,  then  all  of  the  automated 
methods  are  better  than  the  manual  method . 

Thus,  using  the  graphs  it  is  possible  in  a  number  of  ways  to 
compare  the  present  method  to  the  proposed  method.  Some  examples  follow. 

Suppose  that  it  has  been  decided  a  priori  that  the  reading 
interval  ought  to  be  five  minutes.  Then  Table  3  can  be  obtained  from 
the  graphs . 


Table  3 

COST  COMPARISON  WITH  READING  INTERVAL  OF  FIVE  MINUTES 


Cost  Estimate 

5-Year  Cost  (thousands 

of  dollars) 

Present 

Number  of  Ceilometers 
in  Automatic  System 

Method 

1 

2 

3 

■1 

6 

Low  cost 

102 

42 

70 

98 

127 

183 

Medium  cost 

195 

75 

120 

165 

210 

255 

300 

High  cost 

273 

131 

196 

261 

325 

390 

155 

From  Table  3  it  can  be  seen  that  for  a  5-minute  interval  airports  using 
1,  2,  or  3  ceilometers  with  the  automated  method  would  achieve  lower 
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cost  than  will  airports  with  the  present  method.  But  the  opposite  would 
be  true  for  airports  using  4,  5,  or  6  ceilometers  with  the  automated 
me t  hod  . 

Another  way  of  using  the  charts  would  be  to  hypothesize  a 
situation  in  which  a  maximum  of  $230,000  were  available  for  each  air¬ 
port  and  it  would  bo  used  to  minimi, e  the  time  interval  between  readings. 
Table  4  shows  the  results  of  such  a  comparison. 

Table  4 

SPEND  IIP  TO  $250,000  AT  EACH  AIRPORT  AND  MINIMIZE 
THE  TIME  INTERVAL  BETWEEN  READINGS 


Cost  Estimate 

Present  method 
($250,000  is  spent) 
time  interval 

achieved 

Proposed  method 
(time  interval  <1  minute) 

Low  cost 

Medium  cost 
High  cost 

1.67  min. 

3.5  min. 

5.2  min. 

all  airports  <$250,000 

1-4  ceilometer  airports  <$250,000 
1-2  ceilometer  airports  <$250,000 

In  Table  4  it  can  be  seen  that  airports  requiring  1  or  2 
ceilometers  in  the  automated  method  would  spend  less  money  and  achieve 
lietter  results  than  the  present  methods.  For  airports  requiring  more 
than  2  ceilometers  for  the  automated  method  a  more  careful  cost  study 
is  needed  before  the  decision  could  be  made. 

A  number  of  other  interesting  relationships  can  be  found  in 
the  graphs,  but  the  one  proposed  here  as  the  most  meaningful  is  shown 
in  Table  5.  This  table  was  constructed  from  the  graphical  findings  to 
show  those  reading  intervals  for  which  one  method  is  superior  to  the 
other,  regardless  of  cost  uncertainty.  Note  in  Table  5  that  for 
reading  intervals  greater  than  45  minutes  the  current  manual  method  is 
superior  (costs  less).  For  reading  intervals  of  2  minutes  or  less  the 
automated  method  is  superior,  even  at  6-ceilometer  airports.  In  between 
those  limits  the  choice  is  a  function  of  the  number  of  ceilometers 
required  by  the  automated  system.  When  the  number  of  ceilometers  is 
known,  then  the  uncertainty  band,  due  to  cost  uncertainty,  is  small. 
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Table  5 


COMPARISON  OF  MANUAL  AND  AUTOMATED  SYSTEMS. 
REGARDLESS  01  COST  1  NCERTA1NTY 


A  =  Proposed  automated  better  t rx r-  current 
manual  regardless  of  cost  uncertainty. 

M  -  Current  manual  bet.tei  than  proposed 

automated  regardless  of  cost  uncertainty. 

U  =  Uncertain. 
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The  over«heI«i~(  importance  of  t!i?  reading  interval  is  cleir. 
If  it  is  not  to  be  reduced,  then  piesent  systems  are  adequate.  If  it 
is  to  be  reduced,  then  the  proposed  automated  system  becomes  increasingly 
desirable. 
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VII 


CONCLUSIONS 


The  research  effort  described  in  this  report  was  a  theoretical 
study  of  the  problem  of  automatically  describing  cloud  amount  and  cloud 
base  height.  The  study  was  based  on  the  analysis  and  computer  simula¬ 
tion  of  a  theoretical  cloud  model.  To  the  extent  that  this  model  gives 
an  adequate  representation  of  actual  cloud  conditions,  this  study 
demonstrated  the  theoretical  feasibility  of  automation.  It  was  shown 
that  cloud  amount  estimates  more  accurate  than  those  of  human  observers 
can  be  obtained  by  properly  averaging  the  responses  of  an  appropriate 
number  of  conventional  vertically-pointing  instruments.  It  was  also 
shown  that  clustering  techniques  can  determine  the  number  of  cloud 
layers,  the  mean  height  of  each  layer,  and  the  cloud  amount  in  layers. 

The  definitions  of  cloud  amount  and  cloud  base  height  used 
in  this  work  corresponded  to  the  way  these  quantities  were  defined  in 
the  cloud  model.  Thus,  the  cloud  amount  in  a  given  layer  was  defined 
as  the  fraction  of  the  ground  area  covered  by  clouds  in  that  layer  and 
not  covered  by  clouds  in  lower  layers.  The  cloud  base  height  for  a 
given  layer  was  defined  as  the  average  height  of  the  base  of  the  clouds 
in  that  layer.  When  the  distance  between  two  layers  is  small  relative 
to  the  standard  deviation  for  the  cloud  base,  the  clustering  procedures 
may  not  be  able  to  resolve  the  two  layers.  In  such  cases,  it  may 
ultimately  be  preferable  to  adopt  the  operational  definition  that  the 
number  of  layers  and  the  mean  base  heights  for  each  layer  are  defined 
by  the  results  of  clustering.  However,  this  is  not  recommended  until 
more  experience  with  the  characteristics  of  the  clustering  procedures 
has  been  obtained. 

The  number  and  characteristics  of  the  instruments  required 
for  an  automatic  system  depend  on  the  performance  specifications.  A 
number  of  theoretically  and  experimentally  determined  relations  between 
the  accuracy,  number  of  instruments,  sampling  interval,  area  covered, 
cloud  parameters,  and  averaging  time  were  presented.  Design  curves  were 
given  to  show  how  these  relations  can  be  used  to  establish  some  of  the 
more  important  instrument  and  computational  requirements.  This  report 
did  not  address  the  question  of  how  the  performance  specifications 
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should  bo  determined,  or  the  question  of  1k>w  the  cloud  parameter  data 
should  be  obtained.  These  are  important  topics  that  must  oe  treated 
with  care  in  trie  design  "f  an  actual  automated  system. 

A  simple  cost  comparison  ot  the  present  manual  and  possible 
automatic  systems  was  made  to  address  the  question  of  economic  feasi¬ 
bility.  Ir.  making  a  comparison,  it  was  assumed  that  the  systems  gave 
equivalent  performances,  and  other  possible  benefits  of  either  system 
were  not  considered.  The  results  were  found  to  depend  heavily  on  the 
frequency  with  which  instruments  must  be  read  and  the  number  of  instru¬ 
ments  required  for  an  automatic  system.  If  a  frequency  of  one  reading 
per  hour  is  satisfactory,  then  the  manual  system  is  definitely  less 
expensive.  The  crossover  point  occurs  in  the  frequency  range  of  six  to 
twelve  readings  per  hour,  the  exact  results  depending  on  the  number  of 
instruments  required  and  the  effects  of  cost  uncertainties. 


Appendix  A 

CLOUD  DETECTION  BY  SCANNING  MICROWAVE  RADARS 
by  R.  H.  Blackmer,  Jr. 

1 .  In  troduction 

Microwave  radars  operate  in  the  frequency  range  b<  eon  100  .Mil/, 
and  100  GHz.  Within  this  frequency  range,  many  meteorological  targets 
including  clouds  are  detectable.  Interest  in  cloud  detection  led  to  tile 
design,  testing,  and  operational  use  of  radars  specifically  for  cloud 
detection.  Because  of  the  nature  of  cloud  cover,  the  end  result,  to  date 
has  been  radar  sets  with  short  wavelengths,  say  0.S6  cm,  a  transmitting 
and  receiver  antenna  oriented  in  a  fixed  position  directed  toward  the 
zenith,  and  a  short  pulse  length,  narrow  beam,  and  low  power  output. 

While  radars  with  the  above  characteristics  have  evolved  as  the 
optimum  systems  to  meet  old  requirements,  i.e.,  as  an  instrumental  aid 
tc  numan  observers,  they  are  not  necessarily  the  optimum  answer  in  a 
fully  automated  system.  Parallel  to  the  development  of  these  "radar 
ceilometers, "  there  have  been  developments  in  other  types  of  radar  sys¬ 
tems  for  other  purposes  that  could  aid  or  even  replace  the  radar  ceilo- 
meters,  depending  on  various  trade-offs  in  cost  and  type  of  data  desired. 

If  three-dimensional  data  on  cloud  cover  at  frequent  intervals  are 
desired,  then  a  narrow,  fixed  vertical  beam  is  obviously  not  acceptable, 
and  a  scanning  antenna  system  must  be  employed.  The  problems  of  attempt¬ 
ing  to  scan  with  one  of  the  radars  designed  as  a  vertically  pointing 
cloud  ceilometer  were  discussed  at  length  by  Blackmer  and  Ligda  (1963). 
Cogent  points  from  their  report  will  be  presented  later  in  this  report. 

If  accurate  cloud  base  measurements  (especially  when  low  clouds 
prevail)  are  desired  for  airport  operations,  the  types  of  currently 
available  radars  capable  of  scanning  in  three  dimensions  may  not  suffice 
since  they  have  some  severe  limi tations  in  the  detection  of  clouds  at 
short  ranges  and  low  elevation  angles.  These  systems  and  their  limita¬ 
tions  will  be  discussed  later. 

The  problem  then  is  to: 

(1)  Specify  the  operational  requirements 
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(2)  Kxamine  the  characteristics  of  current  equipment  to  see  how 
well  it  meets  these  requirements,  and 

(2)  Make  recommendations  for  either  new  equipment  or  new  use  of 

old  equipment  to  result  in  a  system  that  will  meet  operational 
requi reraents . 

This  Appendix  will  touch  briefly  on  Items  1  and  3  and  will  consider 
Item  2  in  detai 1 . 

2 .  Operational  Requirements 

Since  the  objective  is  an  automated  system  to  provide  cloud  amount 
and  height  over  an  area  the  size  of  an  airport  (or  larger),  some  sort 
of  scanning  instrumentation  is  required.  To  meet  this  general  require¬ 
ment  it  is  necessary  to  scan  the  volume  rapidly  enough  so  that  clouds  do 
not  change  substantially  during  the  period  required  and  to  feed  the 
results  of  the  scan  into  some  computation  system  that  can  convert  the 
frequency  of  target  detection  into  percent  of  cloud  cover  at  various 
levels.  Further,  it  will  be  necessary  to  eliminate  noncloud  targets 
that  are  radar-detectable.  Depending  on  the  characteristics  of  the 
radar,  such  t  rgets  as  insects,  birds,  aircraft,  refractice  index  gra¬ 
dients,  chaff,  lightning,  balloons,  smoke,  dust,  and  ground  targets  in 

sideiobes  may  be  detected.  A  radar  capable  of  detecting  all  clouds 
would  undoubtedly  detect  many  of  the  above-listed  noncloud  targets,  so 
there  must  be  some  trade-offs  between  the  amount  of  cloud  detected  and 
the  number  of  noncloud  targets  detected. 

3 .  C loud  Measurement  Using  Current  Radars 

One  of  the  more  widely  used  radar  ceilometers  is  the  AM/T9Q-11 
vertically  pointing  radar.  Table  A-l  lists  the  characteristics  of  this 
radar  together  with  the  characteristics  of  radars  used  for  long  range 
precipitation  detection  or  for  research  purposes. 

The  table  shows  that  the  Wallops  Island  radar,  except  for  the 
longer  wavelengths,  is  in  some  respects  superior  to  the  TPQ-11,  e.g., 
the  transmitted  paver  is  much  higher,  the  antenna  is  larger,  and  the 
minimum  detectable  signal  is  lower.  The  longer  pulse  length  would, of 
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course, be  a  detriment  to  the  accurate  determination  of  cloud  base  height 
when  the  beam  was  pointed  vertically. 

All  the  radars  listed  except  the  AN/TPQ-11  scan  either  horizjntaily 
or  vertically  or  can  be  operated  as  a  vertically  pointing  instrument. 

The  AN/TPQ-11  is  designed  only  for  operation  in  the  fixed, vertically 
pointing  position. 

The  AN/CPS-9,  because  of  its  relatively  low  power  output,  and  the 
WSR-57,  because  of  its  long  wavelength,  do  not  detect  many  clouds  until 
the  particles  within  them  become  quite  large.  The  relatively  broad 
beamwidths  of  these  radars  would  make  it  impossible  to  measure  accurately 
the  altitudes  of  clouds,  and  low  clouds  would  merge  with  ground  targets. 

The  Wallops  Island  Radar  has  been  used  widely  in  the  study  of  clear 
air  echoes,  but  little  has  been  published  about  the  detection  of  clouds  by 
this  radar.  In  some  instances,  however,  cirrus  cloud  has  been  present 
or  convection  has  proceeded  to  the  point  that  visible  convective  clouds 
were  forming.  On  these  occasions  the  cloud  cover  presents  a  solid  target 
that  is  distinguishable  from  noncloud  targets. 

During  periods  when  the  atmosphere  is  visually  clear  the  radar  may 
show  a  number  of  layers  of  refractive  index  discontinuity  between  the 
surface  and  the  tropopause.  The  level  of  the  tropopause  itese’f  is  often 
evident  in  the  radar  l'eturn.  In  illustrated  examples  of  the  clear  air 
radar  returns,  presented  by  Mather  and  Hardy  (1970),  the  problem  of 
detection  of  ground  targets  and  aircraft  is  obvious.  Detection  of  these 
targets  in  sioelobes  of  the  beam  results  in  the  presentation  over  a 
broad  altitude  range  when  the  antenna  is  scanned  in  the  vertical.  The 
same  would  be  true  on  a  horizontal  scan, so  without  some  method  of  elimi¬ 
nating  these  noncloud  returns  these  "super-sensitive"  radars  would  not 
be  useful  as  part  of  an  automatic  cloud-reporting  system. 

The  possibility  of  scanning  these  sensitive  radars  at  reduced  gain 
for  cloud  detection  could  be  considered,  but  instead  of  operating  these 
radars  in  such  a  manner  it  might  be  more  economical  to  build  a  less  sen¬ 
sitive  radar  for  cloud  detection.  However,  in  the  event  that  such  radars 
were  brought  into  widespread  use  for  such  tasks  as  detection  of  clear 
air  turbulence,  for  example,  joint  use  at  reduced  sensitivity  for  cloud 


detection  might  be  feasible.  This  possibility  is  considered  in  more 
detail  in  the  next  section. 

4 .  Modification  of  Present  Radar  Systems 

Atlas  (1968)  discusses  in  detail  methods  of  clear  air  turbulence 
detection.  He  concludes  that  sensitive  radars,  because  of  their  size  and 
cost,  would  be  of  questionable  economic  feasibility  unless  they  can  be 
time-shared  for  other  purposes.  However,  he  states  that  a  radar  suit¬ 
able  for  clear  air  turbulence  detection  "would  also  be  an  effective 
cloud  base  and  top  indicator  as  well."  In  his  discussion  of  adapting 
surplus  military  radars  for  clear  air  turbulence, he  estimates  a  modifi¬ 
cation  cost  of  $39,000  per  unit.  Presumably  this  is  with  a  fixed,  ver¬ 
tically  pointing  antenna,  and  a  mount  for  a  rotatable  antenna  would  add 
to  the  expense.  Such  expense  would  probably  be  justified,  since  scanning 
would  probably  be  necessary  to  give  the  required  aerial  coverage. 

Blackmer  and  Ligda  (1963)  made  a  study  of  the  increased  cloud 
detection  possible  with  a  radar  with  a  fixed  scanning  angle.  The  angle 
chosen  by  considering  the  size  of  the  vertical  component  of  the  pulse 
volume  was  53  degrees  from  the  vertical.  A  scanning  rate  of  one  revolu¬ 
tion  in  12  minutes  was  chosen  on  the  basis  of  the  radar  data  recording 
system.  At  this  scan  rate  one  degree  is  10  recorder  sweeps.  With  this 
angle  the  altitude  at  which  clouds  are  detected  varies  from  the  surface 
at  the  antmna  to  60,000  feet  at  13  nautical  miles.  Clouds  around  1,000 
feet  would  be  detected  at  a  distance  of  two  nautical  miles.  Such  opera¬ 
tion  would  not  fulfill  the  requirement  for  a  scanning  radar  covering  all 

i 

altitude  intervals  over  an  airport.  A  series  of  elevation  angles  would 
be  required  to  cover  a  broad  altitude  range. 

At  the  fixed  altitude  scan  of  53  degrees  the  presentation  of  the 
clouds  on  the  display  scope  suffered  serious  distortion.  The  report  by 
Blackmer  and  Ligda  gives  several  examples  of  what  the  cloud  cover  would 
look  like  when  scanned  vertically  and  would  look  like  when  scanned  at 
53  degrees.  Scanning  would  obviously  require  some  type  of  constant 
altitude  presentation  to  make  order  from  the  chaos  that  scanning  at  a 
single  angle  would  give.  Data  processing  of  scans  at  multiple  angles  would 


113 


be  very  complex  even  with  only  a  simple  "y°s"  or  "no"  as  (  o  whether  or 
not  there  was  echo  at  a  given  slant  range  on  each  scan.  To  quantify 
echo  intensity  at  a  number  of  ranges  at  a  number  of  elevation  angles 
and  to  convert  these  numbers  to  percent  of  cloud  cover  at  various  heights 
would  be  very  complicated.  The  problem  arises  when  the  beam  intercepts 
the  base  of  the  clouds  at  one  range  and  the  sides  of  the  clouds  at  a 
greater  range  wtien  there  is  one  layer  of  clouds.  When  there  are  multiple 
layers,  one  can  never  be  sure  whether  the  beam  is  intercepting  the  base 
of  the  clouds  with  a  single  .scan  angle.  Multiple  scan  angles  must  be 
rapid  enough  so  that  clouds  do  not  move  or  change  much  between  scans  so 
that  a  given  cloud  element  can  be  identified  on  subsequent  scans.  Modi¬ 
fication  of  present  radar  ceilometers  for  rapid  scanning  over  a  sector 
containing  an  airport  should  be  considered.  Scanning  a  sector  would  be 
simpler  than  a  360-degree  scan  and, if  only  the  lowest  clouds  were  of 
interest,  the  radar  could  be  located  so  Lhat  the  range  of  elevation  angles 
required  would  be  limited.  There  would  be  some  trade-off  required  between 
distance  from  the  airport  and  range  of  elevation  angles.  If  the  radar 
was  close  to  the  airport,  the  range  of  elevation  angles  required  would  be 
greatei  .  If  the  radar  was  far  from  the  airport,  the  range  of  elevations 
would  be  smaller, but  the  greater  distance  from  radar  to  clouds  might 
reduce  the  chances  of  detecting  some  clouds. 

Present  weather  radars  could  undoubtedly  be  modified  in  the  manner 
Atlas  suggested  for  surplus  military  radars  at  a  cost  of  about  $40,000 
each.  Better  still,  current  ultrasensitive  cloud  radars  could  be  used 
to  study  clouds,  a  determination  made  of  how  much  the  sensitivity  could 
be  reduced, and  a  new  radar  built  incorporating  the  results  of  the  study. 

3 .  Summation 

There  are  in  use  today  fixed  vertically  pointing  radar  ceilometers 
and  precipitation-detecting  scanning  weather  radars .  Neither  of  these 
two  types  of  equipment  can  provide  three-dimensional  information  cn 
cloud  distribution  over  an  airport  either  for  aviation  purposes  or  for 
general  meteorology.  Hither  type  of  equipment  could  be  modified:  the 
radar  eeilometer  by  devising  some  type  of  rotatable  antenna  mount,  although 
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the  limitations  in  cloud  detection  capability  would  still  exist;  the 
weather  radars  by  '-.ading  larger  antennas,  parametric  amplifiers,  and 
an  integrator. 

In  addition  to  the  two  types  of  operational  radars,  there  are 
ultrasensitive  research  radars  ust.1  for  the  study  of  clou,  air  phenomena. 
Such  radars  can  also  detect  clouds  but  would  probably  be  too  expensive 
for  widespread  operational  use  as  cloud  detection  sets  .  Such  radars 
could  serve,  however,  as  prototype  units  and  studies  made  of  how  much 
the  sensitivity  could  be  reduced,  and  from  such  studies  a  new  scanning 
cloud  radar  could  be  designed  and  built. 

Whatever  radar  is  finally  used  will  require  elaborate  auxiliary 
equipment  to  filter  out  birds,  insects,  clear  air  returns,  etc.,  and 
convert  the  received  signal  into  useful  cloud  information.  Design  of 
such  equipment  may  be  more  difficult  than  design  of  the  radar  and  will 
certainly  require  extensive  study. 


Ap^ndix  B 

LIDAR  MEASUREMFfTS  OF  CLOUD  BASE 
by  R.  T.  H.  Col lis 

1 .  Introduction 

The  use  of  the  radar  principle  to  seas ure  cloud  bos.'  rith  pulsed 
light  was  first  employed  in  France  in  1939  (Bureau,  1946).  Developaents 
of  this  approach  using  flash  lamp  light  sources  have  been  used  routinely 
for  aany  years  (terlat  and  Petit,  1961).  The  advent  of  the  pulsed  laser 
in  the  early  1960's,  however,  marked  the  beginning  of  a  series  of  major 
improvements  that  are  still  in  progress.  Initially  using  giant-pulsed 
"(^-switched"  ruby  lasers,  a  number  of  workers  demonstrated  the  facility 
with  which  "lidar, ”  as  the  technique  has  become  known,  could  make  obser¬ 
vations  of  clouds  and  measure  the  heights  of  their  base  (Collis,  1965: 
Andermo  et  al.,  1965). 

Although  these  developments  were  largely  concerned  with  research, 
either  into  the  lidar  technique  or  with  atmospheric  phenomena  (e.g., 

Ccllis  et  al.,  1968),  some  attempts  have  been  made  to  develop  lidars  for 
use  in  routine  cloud  base  measurement  (ASEA,  1968;  Bird  end  Rider,  1968) 
and  to  explore  their  use  in  this  role  (Viezee  et  al.,  1969).  The  problems 
involved  in  this  application  were  discussed  further  by  Collis  (1969),  who 
gave  particular  note  to  safety  considerations.  The  safety  problem  as 
well  as  various  technical  factors  has  led  to  the  exploration  of  lasers 
operating  outside  the  visual  range,  and  low-peak-power  systems  using 
solid-state  lasers  are  under  development  (e.g.,  Sperry  Rand,  1971). 

Although  there  can  hardly  be  any  doubt  that  the  lidar  technique 
of  cloud  observation  is  preeminent  for  research  applications,  consider¬ 
able  problems  remain  to  be  solved  before  its  full  potential  can  be  realized 
in  practical  operational  systems  intended  for  untended  routine  use  at 
meteorological  observing  stations, 

2 .  Basic  Lidar  Concept 

In  its  basic  form,  lidar  employs  a  laser  simply  as  a  source  of 
pulsed  energy.  In  typical  h-igl -peak- power  systems,  Q-switched  ruby  or 
neodymium  doped  glass  lasers  are  used  to  generate  light  pulses  having 
peak  powers  of  tens  of  megawatts  and  durations  of  10-29  nanoseconds. 
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Preceding  page  blank 


Thei.e  pulses  are  directed  in  Highly  collimated  beans  by  suitable  optical 
systems.  Energy  !ackscat tered  by  the  atmosphere  is  detected  by  a  photo¬ 
multiplier  after  collection  by  a  lens  or  reflector  system.  The  result¬ 
ing  signal  is  evaluated  as  a  function  of  tine  fro*  the  transmission  of 
the  pulse.  It  is  typically  displayed  on  an  oscilloscope,  cither  directly 
or  after  storage  in  a  magnetic-disc  video  recorder.  Polaroid  or  other 
photographs  may  be  used  to  record  the  displayed  data. 

Taking  advantage  of  the  r.onochronatic  nature  of  the  laser  energy, 
narrow- band  optical  filters  nay  be  used  in  the  receiver  system  to  mini¬ 
mize  "noise"  caused  by  extraneous  light  and  thu^  make  daytime  use  readily 


possible . 


The  nature  of  the  received  signal,  P^.is  given  by  the  equation: 


P  (.1)  =  P„  l  S(R)  A  ft 
r  t  r 


-2 


exp 


J- 


.-2  f  J(r)d 

(  0 


■! 


(B-l) 


where 


P^  is  instantaneous  recived  power 
P  is  transmitted  power 
t  is  effective  pulse  length 

(X  =  ct/2  where  c  is  the  velocity  of 
light  and  ~  is  pulse  duration;  it  is 
the  range  interval  from  which  signals 
are  simultaneously  received  at  time  t.) 
p  is  the  volume  backscattering  coefficient 
of  the  atmosphere 

k  is  range  ( R  =  c(t-tg)/2,  where  tQ  is  the 
ti;  e  of  transmission  of  pulse 
c  is  the  volume  extinction  coefficient,  and 
is  the  effective  receiver  aperture. 
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The  magnitude  of  (3  and  a  depend  upon  the  wavelength  of  the  inci¬ 
dent  enery,  and  the  number,  size,  shape, and  refractive  properties  of 
the  illuminated  particles  per  unit  volume.  These  range  from  the  hydro¬ 
meteors  of  precipitation,  cloud, or  fog,  through  the  particulate  components 
of  the  "clear"  aerosol,  to  the  molecules  of  the  gaseous  atmosphere. 
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In  r.-latively  clear  atmospheres  both  the  backseat tering  coeffi¬ 
cient  and  the  extinction  coefficient  are  small,  and  increases  in  back- 
scattering  as  a  function  of  range  can  readily  be  interpreted.  For 
example,  the  presence  of  a  suaien  increase  in  scattering  due  to  a  cloud 
layer  is  obvious  and  unequivocal .  In  strongly  scattering  conditions, 
however,  marked  attenuation  occurs,  and  in  observing  a  cloud,  for 
example,  the  Ildar  return  rapidly  diminishes  as  tha  penetration  of  the 
cloud  increases.  The  evaluation  of  lidar  signals  in  such  circumstances 
thus  presents  certain  difficulties  and  imposes  the  need  for  care  in 
making  qualitative  interpretations .  Typical  values  of  ~  and  c  are  indi-’ 
cated  in  Figure  B-l. 

A  single  lidar  observation  provides  information  on  atmospheric 
scattering  in  a  specified  direction  in  terms  of  scattering  intensity  as 
a  function  ot  range.  Series  of  such  observations  may  be  made  by  scan¬ 
ning  at  angular  intervals  within  a  plane.  If  this  plane  is  vertical,  a 
cross  section  may  be  derived  that  corresponds  to  a  radar  RH1  (Range  Height 
Indicator)  display.  By  making  successive  observations  in  the  zenith  at 
a  single  location,  a  time/height  display  can  be  generated  representing  a 
form  of  vertical  section  parallel  to  the  wind  direction,  or  showing  how 
the  vertical  conditions  change  with  time. 

3 .  Realization  of  Practical  Cloud-Base-Measuring  Systems 

A.  General 

The  application  of  the  lidar  concept  in  a  practical  operational 
form  for  cloud-base  measurement  involves  two  major  technical  problems  „nd 
the  ever-present  problem  of  achieving  technical  solutions  at  an  acceptable 
cost.  The  first  technical  problem  is  concerned  with  the  acquisition  of 
the  basic  raw  data,  and  is  essentially  an  engineering  problem  in  which 
performance  must  be  balanced  against  the  time  factor,  complexity,  safety 
requirements,  etc.  The  second  technical  problem  is  the  recovery  of  the 
wanted  information  from  the  raw  data.  The  interrelations  of  these  prob¬ 
lems  and  their  further  involvement  with  the  economics  of  the  situation 
present  a  very  complicated  task  to  anyone  considering  how  best  to  meet 
operational  reouirements  that  have  themselves  not  been  fully  defined. 
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FIGURE  B-l  VOLUME  BACKSCATTER ING  AND  ATTENUATION  COEFFICIENTS 
CALCULATED  FOR  RUBY  LIDAR  WAVELENGTHS  (0.7  p.) 
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The  individual  aspects  of  the  problems  are  now  considered  separately, 
from  which  will  be  apparent  the  main  uncertainties  that  must  be  resolved 
if  lidar  is  to  play  a  role  in  making  routine  cloud  base  measurements. 

In  general,  as  would  be  expected,  the  solution  appears  the  more  feasible 
as  the  requirement  becomes  simpler.  Difficulties  srow  rapidly  as  the  fuller 
realization  of  lidar's  potentia1  is  cttempted. 

B.  Data  Acquisition 

The  acquisition  of  signals  of  adequate  intensity  for  evalua¬ 
tion  depends  upon  the  optical  properties  of  the  clouds  and  the  inter¬ 
vening  "clear"  atmosphere,  and  upon  the  characteristics  of  the  lidar. 

In  the  simplest  case,  where  a  well-defined  cloui  base  is  encountered 
above  clear  conditions  with  good  visibility,  the  problem  is  simply  one 
of  optimizing  the  engineering  variables  so  that  the  energy  transmitted 
is  sufficient  to  produce  a  signal  larger  than  the  minimum  detectable 
signal  from  the  range  in  question.  The  interrelation  of  the  engineering 
variables  will  be  apparent  fi-om  a  consideration  of  Eq.(B-l)  .  The  matter 
is  discussed  in  greater  derail  by  Northend  et  al.  (1966)  (see  also  Sperry 
Rand,  1971).  With  high-peak-power  pulsed  ruby  or  neodymium  lasers, 
sufficient  performance  can  readily  be  obtained  with  a  single  pulse. 

With  such  lasers,  however,  the  question  of  eye  safety  arises.  The 
immediate  solution  would  be  to  use  lasers  that  operate  well  outside 
the  visual  hazard  range,  i.e.,  at  wavelengths  longer  than  1,4  microns 
(Wyman,  1969).  Another  solution  is  to  reduce  the  energy  transmitted 
in  a  single  pulse  and  use  a  series  of  less  intense  pulses,  integrating 
the  returned  signals  to  achieve  detectable  levels.  The  ultimate  in 
this  approach  is  to  use  very- low-intensity  transmitted  pulses  from 
solid-state  lasers  .  For  example,  gallium  arsenide  lasers  operating 
at  0.91  micron  wavelength  can  produce  useful  power  levels  at  repetition 
rates  of  the  order  of  1,000  pulses  per  second.  In  the  Sperry  Rami  Ga  As 
lidar  system,  in  fact,  pulses  of  1,000  watts  peak  power  are  used  at 
repetition  rates  of  500  pps .  With  these,  integration  of  as  few  as  five 
pulses  is  adequate  to  yield  workable  signal  levels  from  typical  clevd 
bases  under  optimum  conditions. 
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Intermediate  solutions  could  employ  neodymium  VA(i  lasers 
(1.06  micron  wavelength)  operating  at  low  to  moderate  peak  powers  at 
repetition  rates  oi  some  tens  per  second. 

Other  steps  can  also  be  taken  to  reduce  hazards  further. 
Transmitted  beam  divergence  can  be  increased  to  reduce  energy  density 
levels  without  serious  loss  of  resolution  (beam  widths  oi  the  order  of 
1  or  2  degrees  would  be  quite  adequate,  sampling  areas  some  500-100  ft 
in  diameter  at  an  altitude  of  3,000  ft).  However,  this  approach  leads 
to  an  increase  of  the  minimum  detectable  signal  level,  because  of  the  greater 
background  noise  lhat  will  result  fro  i  increasing  the  receiver  beam 
divergence  to  correspond  to  that  of  the  transmitter.  Again,  the  peak 
power  can  be  reduced  by  increasing  the  pulse  length.  Only  a  small 
benefit  can  be  gained  in  this  way  before  range  resolution  is  seriously 
affected . 

On  the  question  of  range  resolution,  Q-switched  lasers  typi¬ 
cally  produce  pulse  lengths  of  20  nanoseconds  or  so — i.e.,  an  effective 
range  resolution  of  1.5  m.  Wi'h  .allium  arsenide  lasers,  pulse  lengths 
are  of  the  order  of  100  nanoseconds  (7.5  m  effective  range  resolution). 

The  technical  problems  of  increasing  these  pulse  lengths  by  substantial 
amounts  have  no  ready  solutions  however. 

Another  technical  problem  of  some  difficulty  is  the  dynamic 
range  involved.  This  is  particularly  acute  with  the  single  pulse 
approach,  for  in  strongly  scattering  atmospheres,  energy  scattered  from 
very  close  ranges  can  saturate  the  detector  system.  This  system  has 
to  be  sensitive  enough  to  handle  very  weak  signals  that  are  returned 
from  more  distant  cloud  fn°te  the  effect  of  the  inverse  range  squared 
term  in  Eq . (B-l)  1,  parti cularly  after  attenuation  due  to  the  turbidity 
of  the  intervening  path.  The  use  of  integration  to  achieve  adequate 
signal  levels  from  multiple  low-intensity  pulses  goes  far  to  overcoming 
the  problem  of  overloading  the  receiver,  for  a  single  pulse  return  will 
be  small,  even  in  strongly  scattering  atmospheres  at  close  range.  (If 
considerable  time  is  needed  to  achieve  adequate  signal  levels,  however, 
this  can  result  in  a  limitation  of  data  rate.)  The  question  of  nearest 
possible  detection  range  is  important.  Because  oT  dynamic  range  problems, 
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and  also  considerations  of  the*  geometry  of  tho  optical  systems 
involved  that  make  it  difficult  to  achieve  adequate  beam  overlap  at 
short  ranges,  the  minimum  range  at  which  cloud  may  be  detected  can  be 
restricted  in  any  particular  system.  In  typical  research  lidars  of  the 
type  used  at  SRI,  this  range  is  of  the  order  of  50  to  100  m  unless 
special  adjustments  are  made.  The  Sperry  Rand  Ga  As  lidar  is  designed 
to  make  measurements  as  close  as  30  m. 

Thus,  as  will  be  seen,  the  design  of  a  lidar  capable  of 
acquiring  the  basic  raw  data  for  the  determination  of  cloud  information 
is  very  much  a  question  of  selection  end  compromise.  As  demonstrated 
witl.  high-performance  research  equipment,  virtually  any  information 
required  can  be  gathered  at  any  one  time.  The  problem  lies  in  deciding 
what  a  general-purpose  practical  system  should  be  capable  of  doing. 

C .  Recovery  of  Cloud  Information  from  Basic  Data 

The  discussion  of  data  acquisition  above  has  mainly  considered 
the  case  of  clearly  defined  cloud  bases  encountered  above  relatively 
clear  atmospheres.  A  lidar  observation  of  such  conditions  is  shown  in 
A-scope  form  (signal  amplitude  versus  range)  in  Figure  B-2 .  h  wever,  when 
tne  cloud  base  is  diffuse  or  low  visual  range  conditions  exist  below  the 
cloud,  the  interpretation  of  the  lidar  signature  is  much  more  complicated 
and  becomes  a  sophisticated  problem  in  pattern  recognition.  The  same  is 
true  where  heavy  rain  or  snow  is  falling.  (Figure  3-3  shows  extreme  cases 
of  this  type.)  The  point  is  important,  for  although  it  may  be  relatively 
easy  for  a  human  observer  to  interpret  one  or  other  forms  of  lidar  data 
display  that  are  possible,  the  automatic  determination  of  information 
by  an  inanimate  system  may  be  surprisingly  difficult. 

For  example,  Figure  B-4  shows  an  RHI  type  of  display  in  which 
data  of  the  type  shown  in  Figure  B^2  are  presented  in  a  range-corrected 
int  insity-modulated  form,  derived  from  a  series  of  shots  made  by  scan¬ 
ning  the  lidar  in  a  vertical  plane  (Viezee, 1971 ) .  The  nature  of  the  cloud  cover, 
particularly  the  height  of  the  cloud  base,  is  apparent  at  a  glance.  In 
contrast,  the  problem  posed  to  a  "black  box"  even  in  the  relatively 
straightforward  conditions  illustrated,  are  considerable.  (Note  bow  the 
eye  can  readily  perceive  the  base  of  the  cloud  at  the  left  <>r  the  section, 
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FIGURE  8-3  OSCILLOGRAMS  OF  RUBY  LIDAR  OBSERVATIONS  OF  CLOUD  BASE, 
(a)  Low  stratus  with  light  drizzle  and  fog  at  the  surface, 
ibl  Cumulonimbus  with  stratocumulus;  intermittent  very  right  rain,  surface 
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even  in  the  presence  of  considerable  background  noise.)  In  conditions 
sucli  as  those  of  Figure  I>-3  the  task  is  essentially  impossible.  Even  the 
relatively  simple  question  of  determining  cloud  base  height  requires 
that  criteria  be  established  for  distinguishing  what  constitutes  cloud 
base .  As  noted,  this  presents  little  difficulty  when  the  base  is  well 
defined.  Such  automatic  ranging  devices  that  have  been  applied  to  this 
problem  work  very  well  in  such  circumstances.  A  timing  device  is  trig¬ 
gered  at  th  ;  firing  of  the  pulse  and  stopped  when  a  significant  signal 
is  received  from  the  cloud.  The  elapsed  time  may  then  be  displayed  or 
reported  in  digital  form  with  some  precision  (electronic  timing  devices 
can  readily  provide  the  necessary  accuracy,  even  at  light  velocities, 
for  very  precise  measurement  of  distance) .  The  first  problem  arises  if 
the  cloud  base  is  diffuse.  The  returned  signal  increases  as  the  pulse 
penetrates  into  the  cloud  but  is  simultaneously  reduced  by  attenuation. 

In  certain  gradients  of  clou  density  this  will  result  in  anything  but 
the  well  defined  sharp  edged  signatures  that  are  appropriate  for  simple 
timing  devices.  Again,  fluctuating  signals  due  to  variations  in  the 
turbid  "clear"  air  below  the  clord  base  can  be  confusing.  The  setting 
of  arbitrary  criteria  as  to  what  constitutes  "cloud  base”  can  thus  lead 
to  uncertainty  and  inaccuracy.  (It  should  be  remembered  that  the  same 
problem  is  present  with  conventional,  rotating  beam  ceilometers  —  and 
that  the  current  definition  of  "cloud  base  height"  is  the  height  that 
the  rotating  beam  ceilometer  says  it  is.)  Any  simplified  method  of 
assessing  and  reporting  "cloud  base"  suffers  the  further  disadvantage 
of  discarding  some  of  the  most  valuable  data  that  the  lidar  technique 
of  cloud  observation  can  provide — -namely,  a  report  on  the  nature  of 
diffuse  and  ragged  cloud  bases,  including  information  on  the  nature  of 
conditions  in  the  underlying  layer,  particularly  when  multiple  layers 
are  involved  or  where  patches  of  diffuse  cloud  are  present  at  various 
levels  . 

Mention  of  the  latter  condition  raises  the  question  of  the 
spatial  resolution  required  from  a  lidar  ceilometer.  Observations  made 
in  a  single  direction  such  as  the  zenith  will  provide  samples  of  condi¬ 
tions  at  that  point.  Such  samples  can  be  made  continuously  or  at  inter¬ 
vals  determined  by  the  engineering  aspects  of  the  system.  How  well  such 
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observations  can  represent  the  general  conditions,  oven  overhead,  will 
depend  upon  the  spatial  and  temporal  variability  of  conditions  and  any 


motion  that  is  present.  In  principle,  of  course,  the  lidur  technique 
offers  the  possibility  of  scanning  in  two  or  three  dimensions  to  derive 
much  more  complete  information  on  the  cloud  cover  situation.  In  prac¬ 
tice,  the  engineering  performance  readily  attainable  will  ,set  a  limit 
to  this  approach,  and  it  is  unlikely  that  more  than  two  cross  sections 
(derived  at  right  angles)  of  the  type  shown  in  Figure  B--4  could 
be  achieved.  An  attractive  alternative  would  be  to  make  a  circular 
scan  at  a  fixed  angle  of,  say,  60-70°.  In  this  way  conditions  at  each 
height  level  would  be  sampled  around  the  circumference  of  a  circle,  the 
diameter  of  which  would  increase  with  height. 

A  further  possibility  is  also  available,  if  the  beam  is 
scanned  in  any  systematic  manner — for  example,  in  a  conical  or  circular 
scan  or  in  one  or  more  planes,  the  data  acquired  could  be  integrated 
before  interpretation  and  presentation.  In  this  way  average  or  represen¬ 
tative  determinations  of  cLoud-base  heights,  etc.,  could  be  more  readily 
derived  than  by  considering  the  data  in  basic  form.  While  this  has 
attractions,  particularly  where  the  acquisition  of  the  raw  lata  involves 
integration  of  many  pulses,  the  resulting  information  would  have  serious 
deficiencies  for  many  purposes  for  which  cloud  base-data  are  reported, 
and  would  again  fail  to  offer  the  real  beneiir  of  lidar-acquired  data. 

This  aspect  of  the  problem  is  closely  related  to  the  method 
•>f  presenting  or  reporting  fhe  information  derived.  Whether  integration 
is  performed  before  or  after  interpretation,  or  whether  sampling  in 
space  and  time  is  carried  on*',  all  involve  the  rate  it  which  the  raw 
data  are acqui rod.  and  this  in  turn  affects  the  complexity  and  sophisti¬ 
cation  required  in  the  r ; n a  1  displav  or  reporting  device.  This  is  par¬ 
ticularly  so  if  the  maximum  recovery  of  data  or  utilization  or  the  taw 
data  is  sought.  For  example,  if  it  is  sufficient  to  determine  a  single 
value  of  cloud  height  when  a  (  lear  cut  base  exists,  a  very  elegant  s  low- 
response  digital  range-evaluating  system  can  be  achieved  by  discarding 
all  other  data — which  or--  redundant  by  definition.  This  is  the  course 
followed  in  the  Sperry  hand  (a  As  system,  in  wtnen  a  t  >’av  ling  range 
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gate  scans  through  the  total  effective  range  of  3,000  ft.  In  6  seconds. 
The  range  at  which  significant  signals  occm  in  the  gate  (i.e.,  above 
a  predetermined  threshold  that  varies  with  range)  is  thus  readily  deter¬ 
mined  as  the  ''cloud  base  height."  More  complex  systems  with  broad-band 
capability  are  needed  to  handle  more  complete  information  if  comprehen¬ 
sive  information  on  cloud- base  conditions  arc  required,  or  if  the  system 
is  to  produce  accurate  information  in  many  types  of  weather  condition 
encountered . 

In  summary,  as  with  the  acquisition  of  the  raw  data,  the 
lidar  technique  offers  very  wide  capability  to  make  any  type  of  presen¬ 
tation  required.  However,  the  cost  and  complexity  of  achieving  any 
given  degree  of  sophistication,  especially  if  the  system  is  to  work  on 
an  untended,  automatic  basis,  mount  rapidly  as  any  but  the  simplest 
ceiling  measurement  is  attempted. 

4 .  Safety 

The  question  of  safety  has  been  alluded  to  above.  Although  the 
matter  has  often  been  exaggerated,  the  fact  remains  that  laser  energy 
improperly  or  injudiciously  used  can  be  hazardous  to  sight.  Certainly, 
giant  pulsed  systems  operating  in  the  visible  and  near-infrared  spec¬ 
trum  cannot  be  acceptable  for  automatic,  untended  use  in  the  vicinity 
of  airports  because  of  the  high  peak  power  of  the  energy  they  produce. 
Although  highly  constrained  in  direction,  there  is  always  the  chance 
that  their  transmitted  beam  will  be  viewed  from  above,  and  the  possibil¬ 
ity  also  exists  that  aluminum  or  other  aircraft  surfaces  will  reflect 
the  energy  back  to  the  surface.  The  most  certain  way  of  avoiding  any 
such  risks  is  to  operate  either  at  sufficiently  low  peak  powers  as  to 
leave  an  adequate  margin  of  safety  in  terms  of  generating  energy  den¬ 
sities  that  can  cause  eye  damage,  or  tat  operate  well  away  from  any  fre¬ 
quency  that  can  penetrate'  t  he  outer  surface  of  the  eye  and  t  bus  be 
focussed,  however  imperfectly,  on  the  sensitive  retinal  surface,  or 
'•■orse  still,  on  the  critical  fundus 
15)69;  .fohnson,  ct  al.,  . 
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t  Wyman, 


15*65.';  V  ass  Hindis,  ft  al  ., 


5 .  Conclusion 

The  lidar  technique  offers  considerable  potential  and  flexibility 
for  the  measurement  of  cloud  base  for  operational  purposes.  However, 
the  early  realisation  of  this  potential  depends  upon  overcoming  the 
engineering  and  economic  constraints  imposed  by  the  nature  and  complexity 
of  the  current  technology.  [n  particular,  eye  safety  considerations 
indicate  that  low-peak-power  systems  should  be  used,  but  presently  avail¬ 
able  low-peak-power  lasers  cannot  achieve  adequate  performance  fully  to 
exploit  lidar's  capability  for  obtaining  information  over  extended  angles 
by  scanning,  or  for  that  maLter  for  achieving  adequate  range  of  detection 
in  a  fixed  zenith  pointing  system.  Even  if  adequate  performance  can  be 
achieved  with  sufficiently  high  data  acquisition  rates,  there  remains 
the  problem  of  integrating  and  processing  the  data.  In  this  (as, of 
course, in  other  systems)  the  cost  of  sophisticated  data  processing  could 
be  considerable. 

These  are  yet  early  days,  however,  in  the  development  of  lidar 
technology,  and  considerable  progress  can  confidently  be  expected.  The 
promise  for  the  lidar  technique  may  thus  be  considered  to  be  very  real, 
and  exploration  and  definition  of  the  expected  requirements  of  a  lidar 
cloud-ceiling  monitoring  system  appears  to  be  wholly  justified  at  the 
present  time,  for  no  less  reason  than  that  a  clearer  understanding  of  the 
requirement  will  undoubtedly  stimulate  the  appropriate  technical  devel¬ 
opment  . 


Appendix  C 


letting  AT  approach  zero.  In  this  case  we  obtain 


m.in  AT-*0  m 


-2vT/l 
~  e  m 

2vT/ 1 


(C-ll) 


This  is  the  smallest  value  of  a  “  that  can  be  obtained  merely  by 

in  „ 

a  2  ..2 

sampling  faster.  If  AT  is  not  zero,  j  is  greater-  than  "  ,  but  it  may 

m  nun 

not  be  much  greater.  We  can  derive  the  percentage  increase  in  c  from 

m 

Eqs  .  (09)  and  (Oil)  as  a  function  of  vAT/£  and  vT/jJ  .  These  results 

m  m 

are  plotted  in  Figure  Ol.  Since  the  percentage  increase  is  a  mono- 
tonically  decreasing  function  of  vT/£  ,  it  is  bounded  by  the  cases  T  -  AT 
and  T  =  <» .  This  leads  at  once  to  the  following  bounds,  which  are  also 
shown  on  Figure  Ol: 


„  2  ,2 
a  -  a  . 

m  mi  n 


v  AT/ 1 
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Cm  min  vAT  1  h  e  m 

.2  S  i  -2vAT/ l 

c  .  m  .  m 

min  1  -  e 
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„  2  ~2 
Note  that  in  the  worst  case  a  is  less  than  20  percent  greater  than  a 

m  '  mu 

if  AT  =  0.25£  /v,  and  a  further  reduction  in  AT  does  not  yield  much 
m 

furl  her  benefit  .  Thus,  for  example,  if  the  mean  cloud  length  is  200  ft. 
and  the  cloud  speed  is  10  ft. /sec.,  there  is  little  value  in  sampling 
faster  than  once  every  five  seconds. 

„2 

In  theory,  the  minimum  expected  squared  error  c“ .  can  be  reduced 

nun 

by  increasing  the  averaging  time  T.  In  particular,  if  T  is  large  rela- 
2  2 

tive  to  l  /v  a  is  approximately  c  !.  /vT,  which  approaches  zero  as  T 
m  m  a  m 

approaches  infinity.  In  practice,  one  can  not  allow  T  to  become  too 


large  without  losing  the  ability  to  respond  to  changes  in  e  .  Thus, 
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one  should  probably  select  the  largest  T  that  is  acceptable  as  far  as 
response  to  systematic  change  in  c  is  concerned,  and  see  whether  or 

a2  a 

not  the  resulting  a  .  given  by  Eq .  (C-ll) is  acceptable .  If  it  is  not, 
min 

a  single  vertically-pointing  instrument  will  not  be  adequate,  no  matter 
how  small  AT  is  m^rio. 
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Appendix  D 


ANALYSIS  OF  CLOUD- AMOUNT  VAKJANCK 

1  •  Introduction 

In  this  appendix  the  problem  of  estimating  cloud  amount  is  analyzed 
for  tha  case  where  the  instruments  are  located  along  a  straight  line  and 
the  clouds  are  moving  very  rapidly.  The  restriction  of  the  instrument 
locations  reduces  the  problem  to  a  one-dimensional  analysis  in  which  the 
exponentially  distributed  clouds  form  a  Poisson  process. 

The  analysis  proceeds  as  follows.  First  we  establish  the  statis¬ 
tical  characteristics  of  the  cloud  amount  c(x)  over  a  given  point  x. 

Then  we  use  this  to  obtain  the  autocoi relation  function  for  the  cloud 
amount  c(x)  over  an  airport  centered  at  x,  and  the  cross  -correlation 

As 

function  for  c(x)  and  c(x).  Finally,  letting  c  be  the  estimate  of  c 
obtained  by  sampling  at  n  fixed  points,  we  derive  an  expression  for  the 
expected  squared  error  in  terms  of  these  autocorrelation  and  cross - 
correlation  functions. 

2 .  The  One-Dimensional  Case  as  a  Poisson  Process 

Let  c(x)  indicate  the  presence  or  absence  of  a  cloud  above  a 
given  point  x,  with  c  =  1  if  a  cloud  is  present  and  c  =  0  if  a  cloud  is 
absent.  Then  c(x)  is  a  discontinuous  function,  switching  back  and  forth 
from  0  to  1  at  various  transition  points.  We  assume  that  these  transi¬ 
tion  points  have  a  Poisson  distribution;  that  is,  we  assume  that  the 
probability  of  finding  n  transition  points  in  an  interval  of  length  £  is 


where  u  is  the  average  number  ot  transition  points  per  unit  interval, 
if  we  are  to  assume  further  that  transitions  in  nonoverlapping  intervals 
are  statistically  independent,  then  it  is  well  known  that  the  autocor¬ 
relation  function  for  c(x), 

* 

Our  result  is  a  ample  modification  ot  that  given  by  Lee  (Lee,  1960, 

p .  > . 
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Preceding  page  blank 


il)  K_  e  f  x )  c  t  x  •  C.) 


(ii-2) 


is  1;  i  ven  by 

•;  <:>  T  (i  •  .  <U-3> 

This  result  has  a  single  interpretation.  The  independent  Poisson 

assumption  leads  to  an  exponent’  »l  distribution  or  the  lengths  of  the 

intervals  between  tiansi tions,  with  the  mean  interval  being  t  =  1/a. 

m 

Since  a  transition  can  be  from  cloudy  to  clear  or  from  clear  to  cloudy, 

this  mean  interval  2  is  l>oth  the  mean  cloud  length  and  the  mean  hole 

m 

lengtti.  Thus,  we  are  tacitly  assuming  that  the  large-area  cloud  amount 

is  0 .5,  which  is  a  worst  case  from  the  standpoint  of  estimating  cloud 

.mount.  If  the  correlation  shift  C  is  small  relative  to  the  mean  length 

i  ,  ■'  approaches  the  mean  squared  value  of  c(x>,  0.5.  If  l  is  large 
m  cc 

relative  to  l  ,  "  approaches  the  square  of  the  mean  value  of  c(x), 
m  cc 

(0.5)'.  In  general,  the  correlation  between  c(x^)  and  c'x.^)  is  high  if 

x,  -  x  i  «■' ■  i  /2  and  low  if  ;x  -  x  i  >2  /2. 
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2 .  Limited- Area  Cloud  Amount 

Lot  c(x)  denote  the  cloud  amount  over  an  interval  -f  width  w 
centered  at  x.  Then  c  is  related  to  c  by 


c(x) 


w 

•) 


c(u)du 


(0-4) 


The  autocorrelation  function  for  c  is  given  by 


c  c 
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Tc(u)  c(v) '  du  dv 


v)  du  dv 
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where  cc  is  given  by  Eq .  (D-3).  Substituting  for  cp  and  performing 

cc 

the  integration,  we  obtain 
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Note  that  cp-1—— (C)  —*  cp  (C)  as  w  0  and  (C)  —  as  w  -4  03 
u  e  cc  cc~4 

These  two  extremes  correspond  to  airports  of  zero  and  infinite  extent, 
respectively.  As  a  special  case  of  this  result,  we  obtain  the  follow¬ 
ing  expression  for  the  mean  squared  value  of  cT: 


e[c21  = 
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4.  The  Cross-Correlation  Function 

The  cross-correlation  function  between  c  and  o'  is  defined  by 


q>c£(C>  =  E[c(x)  c^x  +  C)  ] 


(D-8) 


Using  Eq.  (D-4)  for  c  and  Eq .  (d-3)  for  cp  ,  we  obtain 
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where  cp^  ~  is  the  cross-correlation  function  given  by  Ec, .  (D-10)  .  For 
the  case  of  uniform  sampling,  we  let 


(D-1'1) 


and 


(D-15) 


for  k  =  1,  n.  ’.Vith  this  choice,  the  estimate  c  is  the  average  of 

the  instrument  responses,  and  the  instruments  uniformly  cover  an  interval 
of  width  w  centered  at  x  -  0.  If  we  use  this  choice  to  estimate  c(x) 
at  x  =  0,  we  obtain 


(D-16) 


Since  the  argument  of  x  ~  ranges  from  greater  than  -  w/2  to  less  than 

c  c 

w/2,  it  follows  that  we  only  need  the  first  half  of  Eq .  ID-10)  for 
cpc~,  corresponding  to  the  strongly  correlated  case.  Substituting  this 
expression,  and  using  the  relation 
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we  obtain 
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where 
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We  note  in  passing  that  if  n  is  very  large,  c  becomes  essentially  the 
same  as  c,  and  E[c  c]  -»  e[c2]»  given  by  Eq .  (D-7)  .  This  can  be  con¬ 
firmed  by  computing  the  limit  of  e[c  c]  ns  n  approaches  infinjty.  At 

the  other  extreme,  when  n  =  1,  c  --  c  and  E[c  c"  1  =  cc  —  ( 0 )  .  This  can  be 

c  c 

confirmed  by  computing  cp  ~(0)  from  Eq .  ( E>— 10)  .  Our  interest,  of  course, 

c  c 

is  in  values  of  n  between  these  two  extremes. 
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Eq.  (D-2), 
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Using  the  uniform  sampling  assumptions  of  Eqs .  (D-14)  and  (D-15)  and  Eq . 

(0-3)  for  cp  ,  we  obtain 
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Again  letting  0  =  e  and  using  Eq,  (D-17),  after  some  manipulation 

we  obtain 
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6 .  Mean  Squared  Error 

Eqs.  (D-7),  (D-18),  and  (D-22)  give  tho  time  terms  in  Eq .  (0-12) 
for  the  mean  squared  estimation  error.  Combining  these  equations,  we 
obtain  our  final  result: 
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This  equation  shows  how  the  expected  squared  error  depends  on 
the  mean  cloud  length  l  -  1/a,  the  size  of  the  airport  w,  and  the 

n  2 

number  of  instruments  n.  Although  a  is  a  relatively  simple  function 
of  aw  and  n,  its  computation  requires  a  certain  amount  of  care,  particu¬ 
larly  when  w/j£  is  small.  By  using  a  Taylor's  series  expansion  with  n 
fixed,  one  can  obtain  the  following  useful  approximations: 
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FIGURE  D-l  MEAN  SQUARED  ERROR  FOR  AN  n- INSTRUMENT  ESTIMATE  OF  CLOUD  AMOUNT 
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Appendix  E 


EFFECT  OF  .SAMPLING  ON  BASE-HEIGHT  ESTIMATION 


This  appendix  presents  an  analysis  of  the  effect  of  sampling  on 
the  estimate  ol  the  base  height  of  a  single  unbroken  cloud  layer.  The 
mathematical  steps  are  essentially  the  same  as  those  encountered  in 
the  cloud  amount  analysis  given  in  Appendix  C.  Thus,  the  following 
development  is  done  with  le.-,s  detailed  explanation. 

Let  the  x-axis  be  aligned  with  the  direction  of  cloud  motion, 
and  let  h(x-vt)  give  the  height  of  the  cloud  over  point  x  at  time  t. 

* 

Let  the  autocorrelation  function  for  the  cloud  bas > ■  height  be  given  by 


Erh(x-C)h(x) 


(E-l) 


where  h  is  the  mean  base  height,  ~  is  the  base  height  standard 

b 

deviation,  C  is  the  correlation  shift  distance,  and  d  is  the  base-height 

correlation  distance.  I*et  an  instrument  located  at  x  sample  the 

cloud  laver  at  times  t,  =  t  -  kiT  to  vield  the  readings  h  =  h(xivt  -kv£T)  . 

k  0  '  k  0  0 

Then  the  correlation  between  successive  readings  is  given  by 


E.  (h.-h)  C-tj.-bl 


2  -v^xj j-k;/u 
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This  correlation  limits  the  advantage  of  using  a  very  short 
sampling  interval  IT  to  improve  the  estimate  of  the  mean  base  height. 
Let  us  estimate  the  mean  base  height  by  the  mean  of  m  successive 
readings, 
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;n 
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Then  fi  is  a  random  variable  with  mean  ii  and  variance  r"" ,  where 

R1  I)’ 


Tins  function  is  discussed  in  Appendix  A  o«  l>uda,  Mancuso,  and 
BJ ackmer  i 1H70) . 


1  17 


Preceding  page  blank 


af  -  e[(& -h)2l 
b  m 


jn  m 

•  ~2  £  £  K[(hj-h)(hk-r,n 

m  j=l  k=l 

-  -  E  E  e"vATlj-k'Al 

m“  j=l  k=l 

2 


b 


m( 1-9 ) 


1+0  - 


29(l-6m) 


m(l-6) 


(E-4) 


where 


8  =  e 


-vAT/d 


(E-5) 


Clearly,  Eq .  (E-4)  corresponds  directly  to  Eq .  (C— 9 ) ,  and  Eq .  (E- 
correspcnds  directly  to  Eq  .  (C-10)  .  From  this,  we  can  conclude  immedi¬ 
ately  that  if  we  fix  the  averaging  time  T  =  mAT  and  let  m  increase  by 

a.2 

letting  AT  go  to  zero,  3^  will  approach  the  minimum  value 


2 

T 

b 


min  vT/2d 


1  - 


1-e 


-vT/d' 


vT/d 


(E-S) 


Note  that  a  .  /a,  can  be  obtained  from  Figure  2  by  replacing  vT/£  by 
min  b  0  2  m 

vT/2d .  If  AT  is  not  zero,  37  is  greater  than  3  ,  but  it  may  not  be 

L  mm 

much  greater.  The  percentage  increase  in  3~  can  be  found  from 

^  zs  2 

Figure  C-l  by  replacing  vAT/j^  by  vT/2d.  Thus,  in  the  worst  case 

is  less  than  20  percent  greater  than  a~ .  if  AT  0.5  d/v,  and  a  further 

mi  n 

reduction  in  AT  does  not  yield  much  further  benefit.  For  example,  if 
the  correlation  distance  is  100  ft.  and  the  cloud  speed  is  10  ft. /sec., 
there  is  little  value  in  sampling  faster  than  once  every  five  seconds  . 
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Appendix  F 


COMPUTATIONAL  RF.QUI  KKMENTS  FOR  HIERARCHICAL  CLUSTERING 

In  this  appendix  a  simple  analysis  is  made  of  the  Lime  required 
l'or  the  modified  hierarchical  clustering  procedure  described  in 
Section  IV-D-2.  The  following  description  is  an  elabora tion  of  that 
procedure  written  in  a  form  that  more  closely  resembles  and  actual 
computer  program.  The  input  to  the  program  is  an  array  of  N=mn  base 
height  readings,  m  readings  from  each  of  n  ccilometers,  taken  during  an 
averaging  period  T.  As  these  readings  are  processed,  the  resulting 
cluster  centers  are  stored  in  the  same  array;  the  array  index (:*  )  is 
used  to  indicate  which  entries  are  cluster  centers.  Using  the 
notation  of  Section  IV-D-2,  the  procedure  is  as  follows; 


(1)  Order  the  N  readings  h.  so  that  h  ^  h 


1 


.  .  .  <  h  . 

N 


(2)  Initialize: 

^2 

6  ^  N;  a  - 


0; 


For  i=l  to  N: 

*-  1 ;  fv  -  h ;  index  (i)  i 

(3)  Find  nearest  clusters: 

2 

d“  -  very  large  number; 
mm 

For  i- 1  to  g-1: 

j  -  index  (i); 
k  «-  index  ( i s  1 ) ; 


n .  n 
J  < 


(ft.  -  ft,  )  ; 
V  J  k 
2 


n  .  +r 
J 

2  2 

If  d“  <  d“ .  then  d 

mi n  min 


2 

d  ;  i 


mm 


1 ; 


else  continue. 


(1)  Update: 


j  index  (i  .  ); 

mm 

k  index  (i  h1) 

m  i  n 


.2 

P 


l  (i2 

N-  l  mi n ’ 


M!) 


n  .  fi .  t  n,  fi, 

ft.  _  -iLi. - liJi  ; 

J  "j  +  \ 

n  -  n.  +  n  ;  g  -  g-1; 

J  .1  K 

For  i  =  i  .  +1  to  g 
min 

index  (i)  <-  index  (ill). 

(5)  Test: 

If  o2  <  (60  feet)2 
P 

and  g  >  1 

then  go  to  Step  4. 

(6)  Results: 

Number  of  layers  =  g 

Mean  base  heights:  fi.  ,  ....  i=l  to  g 

index(i) 

.  .  .  .  nindex  (i)  .  . 

Cloud  amount  in  layers:  - — - ,  i=l  to  g 

Pooled  standard  deviation:  a  . 

P 


Aside  from  the  program  itself,  the  primary  storage  requirements 

are  for  the  three  arrays  m,  h  ,  and  index(i),  requiring  3N  words  of 

core  memory.  The  index  array  could  be  eliminated  by  more  clever 

programming,  but  this  might  require  more  execution  time. 

The  most  time-consuming  part  of  this  procedure  is  Step  3,  since 

it  involves  g-1  operations  that  may  have  to  be  repeated  N-l  times, 

with  g  going  from  N  to  2 .  That  is,  the  block  of  steps  in  the  for-loop 

may  have  to  be  executed  N(N-l)/2  times.  Including  the  incrementing 

and  testing  required  for  the  for-loop  itself,  this  involves  6  fetches, 

5  additions  or  subtractions,  2  multiplications  or  divisions,  2  tests, 

and  an  average  of  1  store  each  time.  If  t  , ,  is  the  add-time  and 

add 

t  j  the  multiply-time  for  the  computer,  the  total  time  required  is 
bounded  by 


t  >  N  (7t  +  t  .). 

tot  add  mul 


(7t  ,  ,  +  t  , ) 
add  mu  1 


Since  this  computation  must  be  completed  in  time  T  if  the  processing 
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i  s  no  t 
obtai  n 


to  1  uit  behind  the  rate  at  which  data  is 
the  requirement 


7t  ..  +  t  .  S 
add  mul 


AT 

•> 

n“T 


being  received, 


we 
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